kaicheng001
kaicheng001
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
paper
about 1 month ago
Intern-S1: A Scientific Multimodal Foundation Model
upvoted
a
paper
about 2 months ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification
Organizations
None yet