Submitted by taesiri 179 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency · 61 authors 9.04k 6
Submitted by taesiri 40 Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation · 9 authors 2
Submitted by Ironieser 26 MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs · 6 authors 5 3
Submitted by Kaiyue 26 T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation · 5 authors 29 2
Submitted by mbur 26 Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling · 12 authors 51 10
Submitted by BAOLONGZHANSHEN 22 Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning · 13 authors 2
Submitted by Wyattz23 15 PosterGen: Aesthetic-Aware Paper-to-Poster Generation via Multi-Agent LLMs · 5 authors 105 3
Submitted by omidgh 8 MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment · 11 authors 3
Submitted by Hecheng0625 7 TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling · 6 authors 155 2
Submitted by taesiri 6 ST-Raptor: LLM-Powered Semi-Structured Table Question Answering · 9 authors 22 2
Submitted by taesiri 5 Neither Valid nor Reliable? Investigating the Use of LLMs as Judges · 4 authors 2
Submitted by RuijieZhu 5 MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting · 8 authors 28 2
Submitted by ControlNet 4 Explain Before You Answer: A Survey on Compositional Visual Reasoning · 13 authors 78 2
Submitted by stefan-it 1 German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German · 6 authors 2 5
Submitted by tristan-deep 1 Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing · 3 authors 2 2
Submitted by dipta007 1 If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition · 2 authors 0 2
Submitted by stefanos50 - REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework · 2 authors 7 2