A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published 16 days ago • 166
Reverse-Engineered Reasoning for Open-Ended Generation Paper • 2509.06160 • Published 19 days ago • 144
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Paper • 2509.08755 • Published 16 days ago • 54