UPDF AI

Learning long-term dependencies with gradient descent is difficult

Yoshua Bengio,P. Simard,P. Frasconi

1994 · DOI: 10.1109/72.279181
8,963 citaten

TLDR

This work shows why gradient based learning algorithms face an increasingly difficult problem as the duration of the dependencies to be captured increases, and exposes a trade-off between efficient learning by gradient descent and latching on information for long periods.