-
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 80 -
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Paper • 2509.06501 • Published • 72 -
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
Paper • 2509.02544 • Published • 112 -
Baichuan-M2: Scaling Medical Capability with Large Verifier System
Paper • 2509.02208 • Published • 36
Henry Mayo
hal90000
·
AI & ML interests
I want to have it all
Recent Activity
updated
a collection
about 20 hours ago
Cool Research
updated
a collection
3 days ago
Cool Research
upvoted
a
paper
3 days ago
Baichuan-M2: Scaling Medical Capability with Large Verifier System
Organizations
None yet