Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
2
XUE Boyang
BeyondHsueh
Follow
Henrywang's profile picture
1 follower
·
0 following
https://amourwaltz.github.io/
AmourWaltz
AI & ML interests
Reliable LLM
Recent Activity
upvoted
a
paper
7 days ago
OTC: Optimal Tool Calls via Reinforcement Learning
authored
a paper
27 days ago
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
published
a model
3 months ago
BeyondHsueh/Qwen2.5-1.5B-Open-R1-GRPO
View all activity
Organizations
None yet
Papers
1
arxiv:
2503.24377
models
2
Sort: Recently updated
BeyondHsueh/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Feb 5
BeyondHsueh/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
Feb 1
•
8
datasets
0
None public yet