2 24 7

Chuanyang Jin

Chuanyang-Jin

https://chuanyangjin.com

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

upvoted a paper 2 days ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

upvoted a paper 3 days ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

View all activity

Organizations

upvoted 2 papers 2 days ago

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

Paper • 2506.23046 • Published Jun 29 • 1

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published 3 days ago • 33

upvoted 2 papers 3 days ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published 11 days ago • 52

Agent Learning via Early Experience

Paper • 2510.08558 • Published 3 days ago • 172

upvoted a paper 12 days ago

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published 12 days ago • 17

commented a paper 12 days ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published 13 days ago • 18 •

authored 2 papers 12 days ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published 13 days ago • 18

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

Paper • 2506.23046 • Published Jun 29 • 1

commented a paper 12 days ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published 13 days ago • 18 •

upvoted a paper 12 days ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published 13 days ago • 18

upvoted a paper 17 days ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published 17 days ago • 99

updated a dataset 23 days ago

Chuanyang-Jin/MMToM-QA

Updated 23 days ago • 87 • 4

upvoted a paper 24 days ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published 24 days ago • 105

upvoted a paper about 1 month ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 83

authored a paper 3 months ago

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Paper • 2506.21876 • Published Jun 27 • 28

liked a dataset 3 months ago

maitrix-org/WM-ABench

Viewer • Updated Aug 29 • 113k • 2.77k • 11

upvoted 2 papers 3 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Paper • 2506.21876 • Published Jun 27 • 28

New activity in SCAI-JHU/MUMA-TOM-BENCHMARK 4 months ago

Create README.md

#1 opened 4 months ago by

Chuanyang-Jin

liked a dataset 4 months ago

SCAI-JHU/MUMA-TOM-BENCHMARK

Viewer • Updated Jun 25 • 973 • 314 • 3

Chuanyang Jin

AI & ML interests

Recent Activity

Organizations

Chuanyang-Jin's activity

Create README.md