GPT-5.1-Codex vs Qwen3 VL 30B A3B Instruct
OpenAI vs Alibaba — benchmarks, pricing, and capabilities side by side.
- •GPT-5.1-Codex has the higher intelligence index (88.2 vs 66.5)
- •Qwen3 VL 30B A3B Instruct is cheaper ($0.13 vs $1.25 per 1M input)
- •GPT-5.1-Codex is faster
- •GPT-5.1-Codex has a larger context window (400K)
| GPT-5.1-Codex | Qwen3 VL 30B A3B Instruct | |
|---|---|---|
| Intelligence index | 88.2 | 66.5 |
| Developer | OpenAI | Alibaba |
| Type | Multimodal | Multimodal |
| Access | API only | Open weights |
| Context window | 400,000 tokens | 262,144 tokens |
| Input price | $1.25 / 1M | $0.13 / 1M |
| Output price | $10.00 / 1M | $0.52 / 1M |
| Speed | 188 tok/s | 123 tok/s |
| Released | November 13, 2025 | October 6, 2025 |
| Parameters | — | — |
| Input modalities | Text, Image | Text, Image |
| Output modalities | Text | Text |
Shared benchmarks
GPT-5.1-Codex
Qwen3 VL 30B A3B Instruct
AIME 2025
95.7
72.3
GPQA Diamond
86
69.5
Humanity’s Last Exam
23.4
6.4
LiveCodeBench
84.9
47.6
MMLU-Pro
86
76.4