This is a survey about main principles and difficulties of batch normalization. Its implementation in Caffe2 and TensorFlow are also included. related #3658