AI Hub
All benchmarks
Math

FrontierMath

A benchmark of hundreds of original, exceptionally challenging mathematics problems crafted and vetted by expert mathematicians, covering most major branches of modern mathematics

6Models
26.3Top score
9.6Median

A benchmark of hundreds of original, exceptionally challenging mathematics problems crafted and vetted by expert mathematicians, covering most major branches of modern mathematics from number theory and real analysis to algebraic geometry and category theory.

State of the art over time

Each point is a model at its release date; the line traces the best score to date.

3023158020242025GPT-5 mini: 22.1 (2025-08-07)GPT-5 nano: 9.6 (2025-08-07)o1: 5.5 (2024-12-05)o1o3-mini: 9.2 (2025-01-31)o3-minio3: 15.8 (2025-04-16)o3GPT-5: 26.3 (2025-08-07)GPT-5

Ranking

1GPT-5
26.3
2GPT-5 mini
22.1
3o3
15.8
4GPT-5 nano
9.6
5o3-mini
9.2
6o1
5.5

Related Math benchmarks