GuoLiangTang
Tommy930
AI & ML interests
LLM,NLP,ML
Recent Activity
upvoted
a
paper
20 minutes ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy
Gradient
upvoted
a
paper
20 minutes ago
Rethinking Thinking Tokens: LLMs as Improvement Operators
Organizations
None yet