Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lly0571 's Collections
Suggested Local LLMs
Early Fusion MLLMs
MultimodalEmbeddings

Early Fusion MLLMs

updated Jan 10
Upvote
-

  • Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

    Paper • 2410.13863 • Published Oct 17, 2024 • 38

  • BAAI/Emu3-Chat

    Text Generation • 8B • Updated Oct 24, 2024 • 49 • 73

  • deepseek-ai/Janus-1.3B

    Any-to-Any • 2B • Updated Jan 27 • 10.1k • 590

  • facebook/chameleon-7b

    Image-Text-to-Text • 7B • Updated Jul 23, 2024 • 73k • 190

  • showlab/show-o

    Any-to-Any • Updated Jun 21 • 301 • 16

  • Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

    Paper • 2412.18619 • Published Dec 16, 2024 • 58
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs