AI Hub
← All models

GPT-5.1 vs Qwen3 4B 2507 Instruct

OpenAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

  • GPT-5.1 has the higher intelligence index (89 vs 52.2)
  • Qwen3 4B 2507 Instruct is cheaper ($0.00 vs $1.25 per 1M input)
GPT-5.1Qwen3 4B 2507 Instruct
Intelligence index8952.2
DeveloperOpenAIAlibaba
TypeLLMLLM
AccessAPI only
Context window400,000 tokens
Input price$1.25 / 1M$0.00 / 1M
Output price$10.00 / 1M$0.00 / 1M
Speed115 tok/s
ReleasedNovember 12, 2025August 6, 2025
Parameters
Input modalitiesText, Image
Output modalitiesText

Shared benchmarks

GPT-5.1
Qwen3 4B 2507 Instruct
AIME 2025
94
52.3
GPQA Diamond
88.1
51.7
Humanity’s Last Exam
26.5
4.7
LiveCodeBench
86.8
37.7
MMLU-Pro
87
67.2