Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ReactiveAI 's Collections
Sparse Query Attention (SQA) Research
Interaction SFT Datasets

Sparse Query Attention (SQA) Research

updated 3 days ago

Experimental models with Sparse Query Attention layers. Reducing training time/cost by ~3-10% compared to GQA & MQA, with the same level performance

Upvote
1

  • Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction

    Paper • 2510.01817 • Published 4 days ago • 11

  • ReactiveAI/sSQAT-mm

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/SQAT-mm

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/xSQAT-mm

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/SQAT-m

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/sSQAT-m

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/xSQAT-m

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/xSMQAT-m

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/GQA-Ref-Micro

    Text Generation • 0.0B • Updated 3 days ago

  • ReactiveAI/MQA-Ref-Micro

    Text Generation • 0.0B • Updated 3 days ago
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs