Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Does RL Incentivize Reasoning in LLMs Beyond the Base Model? (limit-of-rlvr.github.io)
84 points by leodriesch on April 22, 2025 | past | 38 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: