GPT-5.1 vs Qwen3 4B 2507 Instruct

OpenAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

•GPT-5.1 has the higher intelligence index (89 vs 52.2)
•Qwen3 4B 2507 Instruct is cheaper ($0.00 vs $1.25 per 1M input)

	GPT-5.1	Qwen3 4B 2507 Instruct
Intelligence index	89	52.2
Developer	OpenAI	Alibaba
Type	LLM	LLM
Access	API only	—
Context window	400,000 tokens	—
Input price	$1.25 / 1M	$0.00 / 1M
Output price	$10.00 / 1M	$0.00 / 1M
Speed	115 tok/s	—
Released	November 12, 2025	August 6, 2025
Parameters	—	—
Input modalities	Text, Image	—
Output modalities	Text	—

Shared benchmarks

GPT-5.1

Qwen3 4B 2507 Instruct

AIME 2025

94

52.3

GPQA Diamond

88.1

51.7

Humanity’s Last Exam

26.5

4.7

LiveCodeBench

86.8

37.7

MMLU-Pro

87

67.2

GPT-5.1 details Qwen3 4B 2507 Instruct details