FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis Paper • 2504.04842 • Published 22 days ago • 35
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Paper • 2504.02542 • Published 25 days ago • 43
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Paper • 2504.01724 • Published 26 days ago • 65
Articulated Kinematics Distillation from Video Diffusion Models Paper • 2504.01204 • Published 27 days ago • 24
SkyReels-A2: Compose Anything in Video Diffusion Transformers Paper • 2504.02436 • Published 26 days ago • 36
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published Jan 24 • 36
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Mar 13 • 302
Apple MLX-compatible 7B LLMs on the 🤗 Hub Collection This collection contains the model weights for 7B LLMs for Apple's MLX framework. Find more information at https://github.com/ml-explore/mlx • 8 items • Updated Sep 2, 2024 • 8
What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs Paper • 2401.02411 • Published Jan 4, 2024 • 14
Make Pixels Dance: High-Dynamic Video Generation Paper • 2311.10982 • Published Nov 18, 2023 • 69
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models Paper • 2310.08491 • Published Oct 12, 2023 • 55