Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mogabr11 's Collections
Data Distillation
Lora and Quantization
search intent convo generation
transformer inference improvement

Lora and Quantization

updated Dec 18, 2023
Upvote
-

  • LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

    Paper • 2310.08659 • Published Oct 12, 2023 • 28

  • QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

    Paper • 2309.14717 • Published Sep 26, 2023 • 44

  • BitNet: Scaling 1-bit Transformers for Large Language Models

    Paper • 2310.11453 • Published Oct 17, 2023 • 102

  • ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

    Paper • 2312.08583 • Published Dec 14, 2023 • 12

  • SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

    Paper • 2312.07987 • Published Dec 13, 2023 • 41
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs