Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
akai
akaifun
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
updated
a model
about 1 month ago
akaifun/qwen2.5-7B-serl-iter4-step100
published
a model
about 1 month ago
akaifun/qwen2.5-7B-serl-iter4-step100
View all activity
Organizations
None yet
spaces
1
Runtime error
Adept Fuyu 8b
🐠
models
3
Sort: Recently updated
akaifun/qwen2.5-7B-serl-iter4-step100
8B
•
Updated
Jul 30
•
9
akaifun/llama3.2_3B_serl_iter3
3B
•
Updated
Jul 30
•
9
akaifun/qwen2.5_7B_serl_250
8B
•
Updated
Jul 25
•
9
datasets
0
None public yet