EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation Paper • 2303.11089 • Published Mar 20, 2023
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Paper • 2412.14167 • Published Dec 18, 2024
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published Apr 11 • 41
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations Paper • 2505.18096 • Published May 23
Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling Paper • 2507.07982 • Published Jul 10 • 33
Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling Paper • 2507.07982 • Published Jul 10 • 33