1 3

Yihong Wu

Yihong7788

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

On Predictability of Reinforcement Learning Dynamics for Large Language Models

upvoted a paper 4 days ago

It Takes Two: Your GRPO Is Secretly DPO

commented on a paper 4 days ago

It Takes Two: Your GRPO Is Secretly DPO

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

On Predictability of Reinforcement Learning Dynamics for Large Language Models

Paper • 2510.00553 • Published 5 days ago • 8

It Takes Two: Your GRPO Is Secretly DPO

Paper • 2510.00977 • Published 5 days ago • 25

commented a paper 4 days ago

It Takes Two: Your GRPO Is Secretly DPO

Paper • 2510.00977 • Published 5 days ago • 25 •

upvoted a paper 4 months ago

REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

Paper • 2505.20046 • Published May 26 • 18

updated a model 5 months ago

Yihong7788/qwen2.5-2wiki-kg-sft-300

Text Generation • 8B • Updated May 11 • 1

published a model 5 months ago

Yihong7788/qwen2.5-2wiki-kg-sft-300

Text Generation • 8B • Updated May 11 • 1

updated a model 5 months ago

Yihong7788/qwen2.5-hotpotqa-sft-300

Text Generation • 8B • Updated May 10 • 4

published a model 5 months ago

Yihong7788/qwen2.5-hotpotqa-sft-300

Text Generation • 8B • Updated May 10 • 4

updated a dataset 5 months ago

Yihong7788/Hard_Question_2WIKI_Train

Updated Apr 24 • 15

published a dataset 5 months ago

Yihong7788/Hard_Question_2WIKI_Train

Updated Apr 24 • 15

updated a model 6 months ago

Yihong7788/Llama-3.2-3B-Instruct_kg_sft_1k

Text Generation • 3B • Updated Mar 26 • 3

published a model 6 months ago

Yihong7788/Llama-3.2-3B-Instruct_kg_sft_1k

Text Generation • 3B • Updated Mar 26 • 3

Yihong Wu

AI & ML interests

Recent Activity

Organizations

Yihong7788's activity