Submitted by Junteng 71 WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents · 15 authors 3
Submitted by Lingaaaaaaa 47 Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models · 6 authors 134 5
Submitted by wenjun-li 26 Reinforcement Learning Foundations for Deep Research Systems: A Survey · 11 authors 15 2
Submitted by glecorve 24 DivMerge: A divergence-based model merging method for multi-tasking · 4 authors 2
Submitted by taesiri 19 Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents · 4 authors 36 3
Submitted by YuyaoGe 15 Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning · 9 authors 2
Submitted by dorni 14 UniVerse-1: Unified Audio-Video Generation via Stitching of Experts · 10 authors 50 2
Submitted by lioooox 10 Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? · 9 authors 13 2
Submitted by taesiri 8 Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers · 5 authors 2
Submitted by cxiong 8 SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents · 7 authors 2
Submitted by JamesXZ 5 Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet · 3 authors 4 2
Submitted by UVSKKR 5 D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning · 6 authors 6 2
Submitted by MElHuseyni 5 Guided Decoding and Its Critical Role in Retrieval-Augmented Generation · 7 authors 2
Submitted by stefan-it 4 Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian · 8 authors 2
Submitted by Youbang 3 R^textbf{2AI}: Towards Resistant and Resilient AI in an Evolving World · 5 authors 2
Submitted by LuJingyi 3 Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping · 2 authors 23 2
Submitted by bearhaon 3 Mechanistic interpretability for steering vision-language-action models · 4 authors 2
Submitted by lgy0404 2 MAS-Bench: A Unified Benchmark for Shortcut-Augmented Hybrid Mobile GUI Agents · 11 authors 8 2
Submitted by TahaKoleilat 2 Singular Value Few-shot Adaptation of Vision-Language Models · 3 authors 5 2
Submitted by sileod 1 Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem · 2 authors 5 2
Submitted by xchu123 1 DCReg: Decoupled Characterization for Efficient Degenerate LiDAR Registration · 6 authors 55 2