On Predictability of Reinforcement Learning Dynamics for Large Language Models Paper • 2510.00553 • Published 5 days ago • 8
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning Paper • 2505.20046 • Published May 26 • 18