Learning long-term dependencies with gradient descent is difficult
Learning long-term dependencies with gradient descent is difficult
Yoshua Bengio,P. Simard,P. Frasconi
1994 · DOI: 10.1109/72.279181
8,963 citaten
TLDR
This work shows why gradient based learning algorithms face an increasingly difficult problem as the duration of the dependencies to be captured increases, and exposes a trade-off between efficient learning by gradient descent and latching on information for long periods.
