Yihong Wu
Yihong7788
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
7 days ago
On Predictability of Reinforcement Learning Dynamics for Large Language
Models
upvoted
a
paper
7 days ago
It Takes Two: Your GRPO Is Secretly DPO
commented on
a paper
7 days ago
It Takes Two: Your GRPO Is Secretly DPO
Organizations
None yet