Shiyu Huang's picture

7 5 14

Shiyu Huang

ShiyuHuang

·

https://huangshiyu13.github.io/

AI & ML interests

RL, Game AI, NLP, CV

Recent Activity

commented on a paper about 2 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

updated a collection 2 months ago

video_benchmark

updated a collection 2 months ago

video_benchmark

View all activity

Organizations

ShiyuHuang's activity

commented a paper about 2 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 27 •

updated a collection 2 months ago

video_benchmark

3 items • Updated Feb 27

upvoted a paper 2 months ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 86

updated a collection 2 months ago

Reasoning

2 items • Updated Feb 27

New activity in THUDM/cogvlm2-llama3-caption 3 months ago

keep mentioning "bilibili" watermark

#6 opened 5 months ago by

中文效果怎么样呢

#1 opened 7 months ago by

authored a paper 4 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 45

liked a dataset 4 months ago

THUDM/MotionBench

Viewer • Updated Jan 8 • 5k • 653 • 2

upvoted a paper 4 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 45

authored a paper 4 months ago

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published Dec 30, 2024 • 19

liked a dataset 4 months ago

AIWinter/LVBench

Updated Sep 13, 2024 • 584 • 5

updated a Space 4 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

liked a model 4 months ago

THUDM/VisionReward-Video

Text Generation • Updated Jan 1 • 306 • 5

liked a Space 4 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

updated 3 Spaces 4 months ago

LVBench Leaderboard

Submit model evaluations to a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard