Glossary
A plain-English glossary of the methods and ideas behind modern AI. Search, sort, or jump to a letter.
5 terms
| Definition | ||
|---|---|---|
| Generative Adversarial Network | Two networks — a generator and a critic — trained against each other. | May 21, 2026 |
| Gradient Descent | Iteratively nudging parameters downhill to minimize a loss. | May 21, 2026 |
| Group Relative Policy Optimization (GRPO) | A critic-free RL algorithm that estimates the advantage baseline from a group of sampled responses to the same prompt. | May 21, 2026 |
| Group Sequence Policy Optimization (GSPO) | A reasoning-RL algorithm that computes importance ratios and clips at the whole-sequence level rather than per token. | May 21, 2026 |
| Grouped-Query Attention | An attention variant that shares key/value heads to speed inference. | May 21, 2026 |