LLMOpen weights

Llama 3.1 Nemotron 70B Instruct

NVIDIA

Updated May 22, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Specifications

Type: LLM
Access: Open weights
Released: October 1, 2024
License: LLAMA 3 1 community license
Parameters: 70000000000
Knowledge cutoff: December 1, 2023
Output speed: 292 tok/s
Latency (TTFT): 0.24s
API pricing: $1.20 in · $1.20 out / 1M tokens

Benchmarks

Reasoning

46.5

Coding

16.9

Math

GSM8K	91.4
AIME 2025	11
MATH-500	73.3

General

MMLU	80.2
MMLU-Pro	69
Humanity’s Last Exam	4.6

Compare Llama 3.1 Nemotron 70B Instruct with

vs Sonar Reasoning Pro vs R1 1776 vs Qwen3.7 Max vs Gemini 3.5 Flash

See all Llama 3.1 Nemotron 70B Instruct alternatives →