AI Hub
← All models

GPT-5.1-Codex vs Phi-4-multimodal-instruct

OpenAI vs Microsoft — benchmarks, pricing, and capabilities side by side.

  • GPT-5.1-Codex has the higher intelligence index (88.2 vs 46.2)
  • Phi-4-multimodal-instruct is cheaper ($0.05 vs $1.25 per 1M input)
  • GPT-5.1-Codex is faster
  • GPT-5.1-Codex has a larger context window (400K)
GPT-5.1-CodexPhi-4-multimodal-instruct
Intelligence index88.246.2
DeveloperOpenAIMicrosoft
TypeMultimodalMultimodal
AccessAPI onlyOpen weights
Context window400,000 tokens128,000 tokens
Input price$1.25 / 1M$0.05 / 1M
Output price$10.00 / 1M$0.10 / 1M
Speed188 tok/s25 tok/s
ReleasedNovember 13, 2025February 1, 2025
Parameters5600000000
Input modalitiesText, Image
Output modalitiesText

Shared benchmarks

GPT-5.1-Codex
Phi-4-multimodal-instruct
GPQA Diamond
86
31.5
Humanity’s Last Exam
23.4
4.4
LiveCodeBench
84.9
13.1
MMLU-Pro
86
48.5