akai's picture

2

akai

akaifun

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

updated a model about 1 month ago

akaifun/qwen2.5-7B-serl-iter4-step100

published a model about 1 month ago

akaifun/qwen2.5-7B-serl-iter4-step100

View all activity

Organizations

None yet

spaces 1

Adept Fuyu 8b

models 3

akaifun/qwen2.5-7B-serl-iter4-step100

8B • Updated Jul 30 • 9

akaifun/llama3.2_3B_serl_iter3

3B • Updated Jul 30 • 9

akaifun/qwen2.5_7B_serl_250

8B • Updated Jul 25 • 9

datasets 0

None public yet