AI Hub
← All models

DeepSeek-V4-Flash vs Llama 3.1 Nemotron Ultra 253B v1

DeepSeek vs NVIDIA — benchmarks, pricing, and capabilities side by side.

  • DeepSeek-V4-Flash has the higher intelligence index (89.4 vs 78.3)
  • DeepSeek-V4-Flash is cheaper ($0.10 vs $0.60 per 1M input)
  • DeepSeek-V4-Flash is faster
DeepSeek-V4-FlashLlama 3.1 Nemotron Ultra 253B v1
Intelligence index89.478.3
DeveloperDeepSeekNVIDIA
TypeLLMLLM
AccessOpen weightsOpen weights
Context window1,048,576 tokens
Input price$0.10 / 1M$0.60 / 1M
Output price$0.20 / 1M$1.80 / 1M
Speed109 tok/s42 tok/s
ReleasedApril 24, 2026April 7, 2025
Parameters284B (13B active)253000000000
Input modalitiesText
Output modalitiesText

Shared benchmarks

DeepSeek-V4-Flash
Llama 3.1 Nemotron Ultra 253B v1
GPQA Diamond
89.4
76
Humanity’s Last Exam
32.1
8.1