Submitted by Chevalier 256 A Survey of Context Engineering for Large Language Models · 15 authors 2.33k 12
Submitted by Senqiao 74 VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning · 6 authors 394 4
Submitted by tonghe90 64 π^3: Scalable Permutation-Equivariant Visual Geometry Learning · 10 authors 1.24k 1
Submitted by krahets 56 Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models · 9 authors 505 2
Submitted by vanilla1116 48 The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner · 7 authors 3
Submitted by Ruihang 41 AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning · 11 authors 50 1
Submitted by ai-alanov 36 RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA Optimization · 7 authors 1
Submitted by yyuncong 26 MindJourney: Test-Time Scaling with World Models for Spatial Reasoning · 8 authors 85 1
Submitted by wangqiang9 24 FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers · 6 authors 466 1
Submitted by yilunzhao 19 AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research · 8 authors 4 1
Submitted by AndreiB137 8 Einstein Fields: A Neural Perspective To Computational General Relativity · 4 authors 60 1
Submitted by ucfzl 5 TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation · 2 authors 20 1