Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kilian303 's Collections
test

test

updated 21 days ago
Upvote
-

  • Running
    345
    345

    Qwen2.5 Omni 7B Demo

    🏆

    Generate text and speech from audio, video, and text inputs


  • Running on Zero
    2.59k
    2.59k

    F5-TTS

    🗣

    F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)


  • Runtime error
    309
    309

    Kokoro TTS Zero

    🎴

    ✨[With v1.0.0] Accelerated TTS on Kokoro-82M


  • fixie-ai/ultravox-v0_5-llama-3_2-1b

    Audio-Text-to-Text • 0.7B • Updated May 6 • 376k • 56

  • Running on Zero
    830
    830

    Sesame CSM

    🌱

    Conversational speech generation


  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10 • 2.85M • • 5.01k

  • OuteAI/Llama-OuteTTS-1.0-1B

    Text-to-Speech • 1B • Updated about 16 hours ago • 114k • 205

  • suno/bark

    Text-to-Speech • Updated Oct 4, 2023 • 23.9k • 1.41k

  • senstella/csm-expressiva-1b

    Text-to-Speech • Updated Apr 17 • 16 • 33

  • Running

    Sesame AI POC

    ⚡

    Full working POC demonstrating text to speech and speech


  • Running
    37
    37

    Spark-TTS

    ⚡

    (Unofficial) Gradio demo for Spark-TTS


  • Sleeping
    2
    2

    VoiceBloom

    ✨

    Generate audio from text with customizable voice and speed


  • Running on Zero
    821
    821

    TripoSG

    🔮

    Generate 3D models from images


  • Running on Zero
    4.97k
    4.97k

    FLUX.1 [Schnell]

    🏎

    Generate images from text prompts


  • MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

    Paper • 2508.01242 • Published Aug 2 • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs