#tsinghua_university — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #tsinghua_university, aggregated by home.social.
-
Does RL Incentivize Reasoning in LLMs Beyond the Base Model?
https://limit-of-rlvr.github.io/
#ycombinator #Qwen #Deepseek_R1 #PPO #GRPO #AIME #RLVR #Tsinghua_University