akai's picture

2

akai

akaifun

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

updated a model about 1 month ago

akaifun/qwen2.5-7B-serl-iter4-step100

published a model about 1 month ago

akaifun/qwen2.5-7B-serl-iter4-step100

View all activity

Organizations

None yet

upvoted a paper 11 days ago

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Paper • 2508.16949 • Published 14 days ago • 22

upvoted a paper 3 months ago

OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation

Paper • 2506.02397 • Published Jun 3 • 35