Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TheOneTrueNiz 's Collections
Papers
Language Models

Papers

updated 3 days ago
Upvote
-

  • R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

    Paper • 2508.21113 • Published 9 days ago • 103

  • Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

    Paper • 2508.16949 • Published 14 days ago • 22

  • EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

    Paper • 2508.21112 • Published 9 days ago • 72

  • UItron: Foundational GUI Agent with Advanced Perception and Planning

    Paper • 2508.21767 • Published 8 days ago • 12

  • SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

    Paper • 2509.02479 • Published 4 days ago • 76
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs