microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 7 days ago • 286k • 1.37k
Running 552 552 Talking Face Generation with Multilingual TTS 👄 Generate a talking face video from text