UPDF AI

Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization

Rajkumar Ramamurthy,Prithviraj Ammanabrolu,5 Authors,Yejin Choi

2023 · DBLP: conf/iclr/RamamurthyABHSB23
International Conference on Learning Representations · 5 Citations