Rolling Forcing: Autoregressive Long Video Diffusion in Real Time Paper • 2509.25161 • Published 7 days ago • 21
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated 8 days ago • 97
GenCompositor: Generative Video Compositing with Diffusion Transformer Paper • 2509.02460 • Published Sep 2 • 25
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper • 2508.10881 • Published Aug 14 • 52
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts Paper • 2507.20939 • Published Jul 28 • 56
4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture Paper • 2507.05163 • Published Jul 7 • 41
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation Paper • 2505.05422 • Published May 8 • 8
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios Paper • 2505.03730 • Published May 6 • 28
Cobra: Efficient Line Art COlorization with BRoAder References Paper • 2504.12240 • Published Apr 16 • 27
EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling Paper • 2504.02402 • Published Apr 3 • 5
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing Paper • 2503.13434 • Published Mar 17 • 27
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control Paper • 2503.05639 • Published Mar 7 • 24
ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper • 2412.11815 • Published Dec 16, 2024 • 26
BrushEdit: All-In-One Image Inpainting and Editing Paper • 2412.10316 • Published Dec 13, 2024 • 35
NVComposer Collection Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images • 4 items • Updated Dec 6, 2024 • 2
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Paper • 2412.03517 • Published Dec 4, 2024 • 19