UPDF AI

A Survey of Preference-Based Reinforcement Learning Methods

Christian Wirth,R. Akrour,G. Neumann,Johannes Fürnkranz

2017 · DBLP: journals/jmlr/WirthANF17
Journal of machine learning research · 409 Citations

TLDR

A unified framework for PbRL is provided that describes the task formally and points out the different design principles that affect the evaluation task for the human as well as the computational complexity.