SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning Paper • 2509.02479 • Published 9 days ago • 80
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Paper • 2509.06501 • Published 3 days ago • 71
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper • 2509.02544 • Published 9 days ago • 112
Baichuan-M2: Scaling Medical Capability with Large Verifier System Paper • 2509.02208 • Published 9 days ago • 36