Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
meetween 's Collections
Meetween's Research Papers
SpeechLMM v1

SpeechLMM v1

updated Jul 15

1st generation of SpeechLMM models, capable of ingesting video, audio and text and generate text as output. From the Meetween consortium (meetween.eu)

Upvote
-

  • meetween/Llama-speechlmm-1.0-s

    Feature Extraction • 2B • Updated Aug 20 • 3

  • meetween/Llama-speechlmm-1.0-m

    Feature Extraction • 4B • Updated Aug 20 • 4

  • meetween/Llama-speechlmm-1.0-l

    Feature Extraction • 8B • Updated Aug 20 • 4

  • meetween/Llama-speechlmm-1.0-xl

    Feature Extraction • 1B • Updated Mar 12 • 16

  • meetween/Llama-speechlmm-1.0-l-ASR

    0.6B • Updated Jun 5 • 3

  • meetween/Llama-speechlmm-1.0-l-ST

    Translation • 9B • Updated Apr 30 • 5

  • meetween/Llama-speechlmm-1.0-l-MT

    Translation • 9B • Updated Jun 18 • 4

  • meetween/Llama-speechlmm-1.0-l-SLU

    9B • Updated Jun 19 • 1

  • meetween/Llama-speechlmm-1.0-l-LIPREAD

    Other • 9B • Updated May 23 • 17

  • meetween/Llama-speechlmm-1.0-l-SQA

    Translation • 9B • Updated May 22 • 3

  • meetween/Llama-speechlmm-1.0-l-SSUM

    9B • Updated Apr 22 • 1

  • meetween/Llama-speechlmm-1.0-l-TSUM

    9B • Updated Aug 22 • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs