17 38 168

Yongliang Shen

tricktreat

tricktreat

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?

authored a paper 4 days ago

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

authored a paper 4 days ago

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

View all activity

Organizations

authored 3 papers 4 days ago

IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?

Paper • 2509.24709 • Published 5 days ago • 3

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Paper • 2509.25160 • Published 4 days ago • 26

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Paper • 2509.25175 • Published 4 days ago • 26

liked a dataset 4 days ago

ZJU-REAL/GSM8K-V

Viewer • Updated 2 days ago • 700 • 549 • 7

upvoted 2 papers 4 days ago

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Paper • 2509.25160 • Published 4 days ago • 26

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Paper • 2509.25175 • Published 4 days ago • 26

authored 2 papers 16 days ago

EviNote-RAG: Enhancing RAG Models via Answer-Supportive Evidence Notes

Paper • 2509.00877 • Published Aug 31 • 1

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published 19 days ago • 46

upvoted 2 papers 18 days ago

EviNote-RAG: Enhancing RAG Models via Answer-Supportive Evidence Notes

Paper • 2509.00877 • Published Aug 31 • 1

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published 19 days ago • 46

liked 2 models about 2 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26 • 293k • • 794

Qwen/Qwen3-32B

Text Generation • 33B • Updated Jul 26 • 5.8M • • 544

authored a paper about 2 months ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12 • 36

upvoted a paper about 2 months ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12 • 36

commented a paper about 2 months ago

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Paper • 2508.05614 • Published Aug 7 • 19 •

authored 3 papers about 2 months ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published Aug 7 • 17

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Paper • 2508.05614 • Published Aug 7 • 19

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published Aug 7 • 21

upvoted 2 papers about 2 months ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published Aug 7 • 17

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published Aug 7 • 21

Yongliang Shen

AI & ML interests

Recent Activity

Organizations

tricktreat's activity