Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OliP 's Collections
NewGen small LMs
Leading Leaderboards
2024 Papers of the year
2023 (and before) Papers of the Year
LLM Deployment
Vision-Language
Long-Context
Audio
Special LMs <10B
🌶️ Spaces
Evaluation
Applications
Coding

Audio

updated Dec 19, 2024
Upvote
-

  • Stable Audio Open

    Paper • 2407.14358 • Published Jul 19, 2024 • 26

  • Qwen2-Audio Technical Report

    Paper • 2407.10759 • Published Jul 15, 2024 • 60

  • kyutai/moshiko-pytorch-bf16

    Updated Sep 18, 2024 • 181k • 186

  • Presto! Distilling Steps and Layers for Accelerating Music Generation

    Paper • 2410.05167 • Published Oct 7, 2024 • 18

  • OuteAI/OuteTTS-0.1-350M

    Text-to-Speech • 0.4B • Updated Apr 17 • 1.21k • 302

  • Foundation Models for Music: A Survey

    Paper • 2408.14340 • Published Aug 26, 2024 • 44

  • fishaudio/fish-speech-1.5

    Text-to-Speech • Updated Mar 25 • 2.11k • 627
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs