AI Hub
All models
LLMOpen weights

Granite 4.1 8B

IBM

Updated May 21, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Specifications

Type
LLM
Access
Open weights
Released
April 30, 2026
Context window
131,072 tokens
Output speed
133 tok/s
Latency (TTFT)
0.47s
Input
Text
Output
Text
API pricing
$0.05 in · $0.10 out / 1M tokens
Capabilities
Function callingStructured output

Benchmarks

Reasoning

Compare Granite 4.1 8B with

See all Granite 4.1 8B alternatives →