Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour - Facebook Research
In this paper, we empirically show that on the ImageNet dataset large minibatches cause optimization difficulties, but when these are addressed the trained networks exhibit good generalization.
8 Jun 2017 ... Abstract · Copy Link · Share in Twitter · Share in Facebook ...