Adam Optimizer
A widely-used adaptive optimizer for training neural networks.
Adam combines momentum with per-parameter adaptive learning rates estimated from the gradients, making it robust and easy to tune. It and its variants (like AdamW) are the default optimizers for training most deep-learning models.