GPT-5.1-Codex vs Qwen3 VL 8B Instruct

OpenAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

•GPT-5.1-Codex has the higher intelligence index (88.2 vs 43)
•Qwen3 VL 8B Instruct is cheaper ($0.08 vs $1.25 per 1M input)
•GPT-5.1-Codex is faster
•GPT-5.1-Codex has a larger context window (400K)

	GPT-5.1-Codex	Qwen3 VL 8B Instruct
Intelligence index	88.2	43
Developer	OpenAI	Alibaba
Type	Multimodal	Multimodal
Access	API only	Open weights
Context window	400,000 tokens	256,000 tokens
Input price	$1.25 / 1M	$0.08 / 1M
Output price	$10.00 / 1M	$0.50 / 1M
Speed	188 tok/s	145 tok/s
Released	November 13, 2025	October 14, 2025
Parameters	—	—
Input modalities	Text, Image	Image, Text
Output modalities	Text	Text

Shared benchmarks

GPT-5.1-Codex

Qwen3 VL 8B Instruct

AIME 2025

95.7

27.3

GPQA Diamond

86

42.7

Humanity’s Last Exam

23.4

2.9

LiveCodeBench

84.9

33.2

MMLU-Pro

86

68.6

GPT-5.1-Codex details Qwen3 VL 8B Instruct details