Papers - a TheOneTrueNiz Collection

TheOneTrueNiz 's Collections

Papers

Language Models

Papers

updated 3 days ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published 9 days ago • 103
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Paper • 2508.16949 • Published 14 days ago • 22
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published 9 days ago • 72
UItron: Foundational GUI Agent with Advanced Perception and Planning

Paper • 2508.21767 • Published 8 days ago • 12
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 4 days ago • 76