AI Hub
← All models

GPT-5 Codex vs Qwen3 VL 235B A22B Instruct

OpenAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

  • GPT-5 Codex has the higher intelligence index (87.1 vs 70.9)
  • Qwen3 VL 235B A22B Instruct is cheaper ($0.20 vs $1.25 per 1M input)
  • GPT-5 Codex is faster
  • GPT-5 Codex has a larger context window (400K)
GPT-5 CodexQwen3 VL 235B A22B Instruct
Intelligence index87.170.9
DeveloperOpenAIAlibaba
TypeMultimodalMultimodal
AccessAPI onlyOpen weights
Context window400,000 tokens262,144 tokens
Input price$1.25 / 1M$0.20 / 1M
Output price$10.00 / 1M$0.88 / 1M
Speed180 tok/s51 tok/s
ReleasedSeptember 23, 2025September 23, 2025
Parameters
Input modalitiesText, ImageText, Image
Output modalitiesTextText

Shared benchmarks

GPT-5 Codex
Qwen3 VL 235B A22B Instruct
AIME 2025
98.7
70.7
GPQA Diamond
83.7
71.2
Humanity’s Last Exam
25.6
6.3
LiveCodeBench
84
59.4
MMLU-Pro
86.5
82.3