Submitted by lxucs 73 ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning · 8 authors 273 2
Submitted by weigao266 53 Speed Always Wins: A Survey on Efficient Architectures for Large Language Models · 15 authors 339 2
Submitted by taesiri 45 S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models · 9 authors 140 2
Submitted by myyycroft 39 When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs · 6 authors 3 2
Submitted by caizhongang 33 Has GPT-5 Achieved Spatial Intelligence? An Empirical Study · 22 authors 2
Submitted by taesiri 25 Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model · 19 authors 1.66k 2
Submitted by petranokhin 25 HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds · 6 authors 12 2
Submitted by klemenk 17 Representing Speech Through Autoregressive Prediction of Cochlear Tokens · 4 authors 2
Submitted by taesiri 14 Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models · 9 authors 44 3
Submitted by rusrakhimov 12 G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration · 5 authors 2
Submitted by walsvid 11 Precise Action-to-Video Generation Through Visual Action Prompts · 8 authors 2
Submitted by xuhuizhan5 8 Inverse-LLaVA: Eliminating Alignment Pre-training Through Text-to-Vision Mapping · 2 authors 7 2
Submitted by jaeunglee 5 Unlearning Comparator: A Visual Analytics System for Comparative Evaluation of Machine Unlearning Methods · 5 authors 72 2
Submitted by YouchengHuang 3 Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information · 6 authors 3
Submitted by j-min 1 RotBench: Evaluating Multimodal Large Language Models on Identifying Image Rotation · 4 authors 5 2