Submitted by benfielding 310 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing · 15 authors 1.35k 38
Submitted by Hanyuezhuohua 74 Parallel-R1: Towards Parallel Thinking via Reinforcement Learning · 11 authors 47 3
Submitted by taesiri 63 Visual Representation Alignment for Multimodal Large Language Models · 13 authors 72 7
Submitted by taesiri 49 Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search · 6 authors 169 2
Submitted by sanaka87 35 Reconstruction Alignment Improves Unified Multimodal Models · 4 authors 94 2
Submitted by fenfan 25 UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward · 6 authors 63 2
Submitted by aopolin-lv 23 F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions · 10 authors 55 2
Submitted by ChillingDream 17 Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding · 11 authors 5 2
Submitted by JasperHaozhe 17 Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning · 6 authors 9 2
Submitted by xianbao 11 Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference · 9 authors 3
Submitted by taesiri 7 SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge · 5 authors 2
Submitted by nfrumkin 6 Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling · 2 authors 9 2
Submitted by jfkback 3 Benchmarking Information Retrieval Models on Complex Retrieval Tasks · 2 authors 2
Submitted by PraneetNeuro 1 From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers · 5 authors 2