SpeechLMM v1 - a meetween Collection

meetween 's Collections

updated Jul 15

1st generation of SpeechLMM models, capable of ingesting video, audio and text and generate text as output. From the Meetween consortium (meetween.eu)

Upvote

meetween/Llama-speechlmm-1.0-s

Feature Extraction • 2B • Updated Aug 20 • 3
meetween/Llama-speechlmm-1.0-m

Feature Extraction • 4B • Updated Aug 20 • 4
meetween/Llama-speechlmm-1.0-l

Feature Extraction • 8B • Updated Aug 20 • 4
meetween/Llama-speechlmm-1.0-xl

Feature Extraction • 1B • Updated Mar 12 • 16
meetween/Llama-speechlmm-1.0-l-ASR

0.6B • Updated Jun 5 • 3
meetween/Llama-speechlmm-1.0-l-ST

Translation • 9B • Updated Apr 30 • 5
meetween/Llama-speechlmm-1.0-l-MT

Translation • 9B • Updated Jun 18 • 4
meetween/Llama-speechlmm-1.0-l-SLU

9B • Updated Jun 19 • 1
meetween/Llama-speechlmm-1.0-l-LIPREAD

Other • 9B • Updated May 23 • 17
meetween/Llama-speechlmm-1.0-l-SQA

Translation • 9B • Updated May 22 • 3
meetween/Llama-speechlmm-1.0-l-SSUM

9B • Updated Apr 22 • 1
meetween/Llama-speechlmm-1.0-l-TSUM

9B • Updated Aug 22 • 2

Upvote

Collection guide
Browse collections