5 15 19

Yang Jian

CSJianYang

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

V-GameGym: Visual Game Generation for Code Large Language Models

commented on a paper 12 days ago

V-GameGym: Visual Game Generation for Code Large Language Models

commented on a paper 27 days ago

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

View all activity

Organizations

upvoted a paper 12 days ago

V-GameGym: Visual Game Generation for Code Large Language Models

Paper • 2509.20136 • Published 14 days ago • 9

commented a paper 12 days ago

V-GameGym: Visual Game Generation for Code Large Language Models

Paper • 2509.20136 • Published 14 days ago • 9 •

commented 2 papers 27 days ago

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Paper • 2508.19813 • Published Aug 27 • 25 •

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Paper • 2508.19813 • Published Aug 27 • 25 •

upvoted a paper about 1 month ago

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Paper • 2508.19813 • Published Aug 27 • 25

commented a paper about 1 month ago

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Paper • 2508.19813 • Published Aug 27 • 25 •

liked 2 datasets about 1 month ago

Multilingual-Multimodal-NLP/IfEvalCode-Instruct

Viewer • Updated Aug 26 • 2.97k • 20 • 1

Multilingual-Multimodal-NLP/IfEvalCode-testset

Viewer • Updated Aug 26 • 810 • 58 • 1

upvoted a paper about 2 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

upvoted a paper 4 months ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 31

upvoted a paper 9 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 52

updated a dataset 10 months ago

CSJianYang/ExecRepoBench

Viewer • Updated Dec 23, 2024 • 1.16k • 30 • 3

upvoted a paper 10 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

liked a dataset 10 months ago

CSJianYang/ExecRepoBench

Viewer • Updated Dec 23, 2024 • 1.16k • 30 • 3

updated a dataset 10 months ago

CSJianYang/CodeArena

Viewer • Updated Dec 18, 2024 • 397 • 127 • 14

upvoted a paper 10 months ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 50

commented a paper 10 months ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 50 •

upvoted a paper 10 months ago

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25, 2024 • 23

updated a model 10 months ago

Multilingual-Multimodal-NLP/SEVENLLM-Qwen1.5-14B

Text Generation • 14B • Updated Dec 1, 2024 • 4 • 1

liked a model 10 months ago

Multilingual-Multimodal-NLP/SEVENLLM-Llama-13B-CoT

Updated Dec 1, 2024 • 1

Yang Jian

AI & ML interests

Recent Activity

Organizations

CSJianYang's activity