Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JM-Brun 's Collections
RL
Diffusion models
Prompt Optimization
Tool calling
Tabular
Multimodal
Agents
Attribution
SLMs
LLM-as-a-judge
LLM Training
LLM-KG
Research Tool
LLM Architecture
LLM Data
World model
Reasonning
LLM Math
Interpretability XAI
Hallucinations

RL

updated 10 days ago
Upvote
-

  • A Survey of Reinforcement Learning for Large Reasoning Models

    Paper • 2509.08827 • Published 12 days ago • 163
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs