Gemini 3.5 Flash vs Llama 3.3 Nemotron Super 49B V1.5

Google vs NVIDIA — benchmarks, pricing, and capabilities side by side.

•Llama 3.3 Nemotron Super 49B V1.5 is cheaper ($0.10 vs $1.50 per 1M input)
•Gemini 3.5 Flash has a larger context window (1M)

	Gemini 3.5 Flash	Llama 3.3 Nemotron Super 49B V1.5
Intelligence index	92.2	—
Developer	Google	NVIDIA
Type	Multimodal	LLM
Access	API only	Open weights
Context window	1,048,576 tokens	131,072 tokens
Input price	$1.50 / 1M	$0.10 / 1M
Output price	$9.00 / 1M	$0.40 / 1M
Speed	221 tok/s	—
Released	May 19, 2026	October 10, 2025
Parameters	—	—
Input modalities	Text, Image, Audio, Video	Text
Output modalities	Text	Text

Gemini 3.5 Flash details Llama 3.3 Nemotron Super 49B V1.5 details