Generate and convert speech using text and audio inputs
Generate high-quality videos from text prompts and images