Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning

TLDR

T EXT 2R EWARD is introduced, a data-free framework that automates the generation of dense reward functions based on large language models (LLMs) that produces interpretable, free-form dense reward codes that cover a wide range of tasks, utilize existing packages, and allow iterative refinement with human feedback.