Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gaunernst 's Collections
DeepSeek testing
Gemma 3 QAT INT4 (from GGUF)
Gemma 3 QAT INT4 (from Flax)
Mini BERT models
Face Recognition Models
LLMs < 1B
LLMs 1B - 2B
LLMs 2B - 4B
Smallish LLM pre-training datasets
Llama2-compatible
Llama3-compatible

LLMs < 1B

updated Sep 29, 2024
Upvote
-

  • Qwen/Qwen2-0.5B

    Text Generation • 0.5B • Updated Oct 22, 2024 • 323k • 154

  • Qwen/Qwen2-0.5B-Instruct

    Text Generation • 0.5B • Updated Aug 21, 2024 • 258k • 197

  • HuggingFaceTB/SmolLM-135M

    Text Generation • 0.1B • Updated Aug 1, 2024 • 287k • 228

  • HuggingFaceTB/SmolLM-135M-Instruct

    Text Generation • 0.1B • Updated Sep 4, 2024 • 55.9k • 123

  • HuggingFaceTB/SmolLM-360M

    Text Generation • 0.4B • Updated Aug 1, 2024 • 74.3k • 67

  • HuggingFaceTB/SmolLM-360M-Instruct

    Text Generation • 0.4B • Updated Aug 18, 2024 • 28.9k • 83

  • apple/OpenELM-270M

    Text Generation • 0.3B • Updated Feb 28 • 456 • 75

  • apple/OpenELM-270M-Instruct

    Text Generation • 0.3B • Updated Feb 28 • 1.33k • 140

  • apple/OpenELM-450M

    Text Generation • 0.5B • Updated Feb 28 • 412 • 26

  • apple/OpenELM-450M-Instruct

    Text Generation • 0.5B • Updated Feb 28 • 892 • 48

  • facebook/opt-125m

    Text Generation • Updated Sep 15, 2023 • 7.08M • 219

  • facebook/opt-350m

    Text Generation • Updated Sep 15, 2023 • 155k • 148

  • amd/AMD-Llama-135m

    Text Generation • 0.1B • Updated Oct 9, 2024 • 8.2k • 116
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs