UPDF AI

Bleu: a Method for Automatic Evaluation of Machine Translation

K. Papineni,Salim Roukos,T. Ward,Wei-Jing Zhu

2002 · DOI: 10.3115/1073083.1073135
Annual Meeting of the Association for Computational Linguistics · 引用 29,758 次

TLDR

This work proposes a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

摘要

Human evaluations of machine translation are extensive but expensive. Human evaluations can take months to finish and involve human labor that can not be reused. We propose a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run. We present this method as an automated understudy to skilled human judges which substitutes for them when there is need for quick or frequent evaluations.