Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning
C. Snell,Jaehoon Lee,Kelvin Xu,Aviral Kumar