GPT-5.1-Codex vs Qwen3 VL 30B A3B Instruct

OpenAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

•GPT-5.1-Codex has the higher intelligence index (88.2 vs 66.5)
•Qwen3 VL 30B A3B Instruct is cheaper ($0.13 vs $1.25 per 1M input)
•GPT-5.1-Codex is faster
•GPT-5.1-Codex has a larger context window (400K)

	GPT-5.1-Codex	Qwen3 VL 30B A3B Instruct
Intelligence index	88.2	66.5
Developer	OpenAI	Alibaba
Type	Multimodal	Multimodal
Access	API only	Open weights
Context window	400,000 tokens	262,144 tokens
Input price	$1.25 / 1M	$0.13 / 1M
Output price	$10.00 / 1M	$0.52 / 1M
Speed	188 tok/s	123 tok/s
Released	November 13, 2025	October 6, 2025
Parameters	—	—
Input modalities	Text, Image	Text, Image
Output modalities	Text	Text

Shared benchmarks

GPT-5.1-Codex

Qwen3 VL 30B A3B Instruct

AIME 2025

95.7

72.3

GPQA Diamond

86

69.5

Humanity’s Last Exam

23.4

6.4

LiveCodeBench

84.9

47.6

MMLU-Pro

86

76.4

GPT-5.1-Codex details Qwen3 VL 30B A3B Instruct details