UPDF AI

Overcoming Catastrophic Forgetting During Domain Adaptation of Neural Machine Translation

Brian Thompson,Jeremy Gwinnup,2 Autores,Philipp Koehn

2019 · DOI: 10.18653/v1/N19-1209
North American Chapter of the Association for Computational Linguistics · 136 citas

TLDR

This work adapts Elastic Weight Consolidation (EWC)—a machine learning method for learning a new task without forgetting previous tasks—to mitigate the drop in general-domain performance as catastrophic forgetting of general- domain knowledge.

Resumen

Continued training is an effective method for domain adaptation in neural machine translation. However, in-domain gains from adaptation come at the expense of general-domain performance. In this work, we interpret the drop in general-domain performance as catastrophic forgetting of general-domain knowledge. To mitigate it, we adapt Elastic Weight Consolidation (EWC)—a machine learning method for learning a new task without forgetting previous tasks. Our method retains the majority of general-domain performance lost in continued training without degrading in-domain performance, outperforming the previous state-of-the-art. We also explore the full range of general-domain performance available when some in-domain degradation is acceptable.