V-GameGym: Visual Game Generation for Code Large Language Models Paper • 2509.20136 • Published 2 days ago • 8 • 1
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables Paper • 2508.19813 • Published about 1 month ago • 25 • 4
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables Paper • 2508.19813 • Published about 1 month ago • 25 • 4
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables Paper • 2508.19813 • Published about 1 month ago • 25 • 4
Evaluating and Aligning CodeLLMs on Human Preference Paper • 2412.05210 • Published Dec 6, 2024 • 50 • 2