AI Hub
All models
MultimodalOpen weights

Llama 4 Scout

Meta

Updated May 21, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Specifications

Type
Multimodal
Access
Open weights
Released
April 5, 2025
License
Llama 4 Community License
Parameters
109B total / 17B active (MoE)
Context window
10,000,000 tokens
Knowledge cutoff
August 31, 2024
Output speed
776 tok/s
Latency (TTFT)
0.31s
Input
Text, Image
Output
Text
API pricing
$0.08 in · $0.30 out / 1M tokens
Capabilities
Function callingStructured output

Benchmarks

Reasoning

Coding

Math

Multimodal

Our take

The smaller Llama 4, notable for an industry-leading 10M-token context window that fits in a single GPU node.

Compare Llama 4 Scout with

See all Llama 4 Scout alternatives →