Cool Research - a hal90000 Collection

hal90000 's Collections

Cool Research

updated 1 day ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 9 days ago • 80
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published 3 days ago • 71
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published 9 days ago • 112
Baichuan-M2: Scaling Medical Capability with Large Verifier System

Paper • 2509.02208 • Published 9 days ago • 36