Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published Jun 4, 2024 • 37
DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation Paper • 2502.03930 • Published Feb 6
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published 18 days ago • 122
KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke Paper • 2110.09121 • Published Oct 18, 2021