QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL Paper • 2510.00967 • Published 8 days ago • 11
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Paper • 2505.21600 • Published May 27 • 70