Large Scale Distributed Deep Networks
Large Scale Distributed Deep Networks
J. Dean,G. Corrado,9 作者,A. Ng
2012 · DBLP: conf/nips/DeanCMCDLMRSTYN12
Neural Information Processing Systems · 引用 4,128 次
TLDR
This paper considers the problem of training a deep network with billions of parameters using tens of thousands of CPU cores and develops two algorithms for large-scale distributed training, Downpour SGD and Sandblaster L-BFGS, which increase the scale and speed of deep network training.
