R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning Paper • 2508.21113 • Published 10 days ago • 104
Embodied AI Collection Embodiment enables interaction of model with environment. Key is to anticipate what change could've come with its current action. • 34 items • Updated 1 day ago • 1
UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Paper • 2503.21620 • Published Mar 27 • 63