-
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 83 -
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Paper • 2509.06501 • Published • 75 -
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
Paper • 2509.02544 • Published • 115 -
Baichuan-M2: Scaling Medical Capability with Large Verifier System
Paper • 2509.02208 • Published • 38
Henry Mayo
hal90000
·
AI & ML interests
I want to have it all
Recent Activity
updated
a collection
4 days ago
Cool Research
updated
a collection
6 days ago
Cool Research
upvoted
a
paper
6 days ago
Baichuan-M2: Scaling Medical Capability with Large Verifier System
Organizations
None yet