Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published 5 days ago • 122
Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning Paper • 2506.09501 • Published Jun 11 • 18
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning Paper • 2509.22576 • Published 8 days ago • 124
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision Paper • 2312.16256 • Published Dec 26, 2023 • 18
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See Paper • 2410.06169 • Published Oct 8, 2024
Prompt-Guided Mask Proposal for Two-Stage Open-Vocabulary Segmentation Paper • 2412.10292 • Published Dec 13, 2024