UPDF AI

Efficient Reductions for Imitation Learning

Stéphane Ross,Drew Bagnell

2010 · DBLP: journals/jmlr/RossB10
International Conference on Artificial Intelligence and Statistics · 962 件の引用

TLDR

This work proposes two alternative algorithms for imitation learning where training occurs over several episodes of interaction and shows that this leads to stronger performance guarantees and improved performance on two challenging problems: training a learner to play a 3D racing game and Mario Bros.