UPDF AI

A comprehensive survey on safe reinforcement learning

Javier García,F. Fernández

2015 · DOI: 10.5555/2789272.2886795
Journal of machine learning research · 1,719회 인용

TLDR

This work categorize and analyze two approaches of Safe Reinforcement Learning, based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor and the incorporation of external knowledge or the guidance of a risk metric.