Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy,Prithviraj Ammanabrolu,5 作者,Yejin Choi
2023 · DBLP: conf/iclr/RamamurthyABHSB23
International Conference on Learning Representations · 引用数 5
