FLOPs
Floating-point operations — the standard unit for measuring the compute used to train or run a model.
Training compute is often quoted in total FLOPs (e.g. 10^25), combining model size, dataset size, and training steps. It is a key input to scaling laws and a common axis for comparing how much was invested in a model.