MATH
Competition mathematics problems requiring multi-step symbolic reasoning.
67Models
97.9Top score
70.6Median
State of the art over time
Each point is a model at its release date; the line traces the best score to date.
Competition mathematics problems requiring multi-step symbolic reasoning.
Each point is a model at its release date; the line traces the best score to date.