GPT-5 Codex vs Qwen3 VL 235B A22B Instruct

OpenAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

•GPT-5 Codex has the higher intelligence index (87.1 vs 70.9)
•Qwen3 VL 235B A22B Instruct is cheaper ($0.20 vs $1.25 per 1M input)
•GPT-5 Codex is faster
•GPT-5 Codex has a larger context window (400K)

	GPT-5 Codex	Qwen3 VL 235B A22B Instruct
Intelligence index	87.1	70.9
Developer	OpenAI	Alibaba
Type	Multimodal	Multimodal
Access	API only	Open weights
Context window	400,000 tokens	262,144 tokens
Input price	$1.25 / 1M	$0.20 / 1M
Output price	$10.00 / 1M	$0.88 / 1M
Speed	180 tok/s	51 tok/s
Released	September 23, 2025	September 23, 2025
Parameters	—	—
Input modalities	Text, Image	Text, Image
Output modalities	Text	Text

Shared benchmarks

GPT-5 Codex

Qwen3 VL 235B A22B Instruct

AIME 2025

98.7

70.7

GPQA Diamond

83.7

71.2

Humanity’s Last Exam

25.6

6.3

LiveCodeBench

84

59.4

MMLU-Pro

86.5

82.3

GPT-5 Codex details Qwen3 VL 235B A22B Instruct details