Training

Batch Normalization: Accelerating Deep Network Training

Google·February 11, 2015

Sergey Ioffe, Christian Szegedy

TL;DR

Normalizes layer activations per mini-batch, dramatically stabilizing and speeding up training.

A key technique that made very deep networks practical to train.