A comprehensive survey on safe reinforcement learning
A comprehensive survey on safe reinforcement learning
Javier García,F. Fernández
2015 · DOI: 10.5555/2789272.2886795
Journal of machine learning research · 引用 1,719 次
TLDR
This work categorize and analyze two approaches of Safe Reinforcement Learning, based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor and the incorporation of external knowledge or the guidance of a risk metric.
