A Survey of Preference-Based Reinforcement Learning Methods
A Survey of Preference-Based Reinforcement Learning Methods
Christian Wirth,R. Akrour,G. Neumann,Johannes Fürnkranz
2017 · DBLP: journals/jmlr/WirthANF17
Journal of machine learning research · 409 Citations
TLDR
A unified framework for PbRL is provided that describes the task formally and points out the different design principles that affect the evaluation task for the human as well as the computational complexity.
