Understanding the difficulty of training deep feedforward neural networks
Understanding the difficulty of training deep feedforward neural networks
Xavier Glorot,Yoshua Bengio
2010 · DBLP: journals/jmlr/GlorotB10
International Conference on Artificial Intelligence and Statistics · 19,134 Citações
TLDR
The objective here is to understand better why standard gradient descent from random initialization is doing so poorly with deep neural networks, to better understand these recent relative successes and help design better algorithms in the future.
