Qwen3-Quantization Collection This is the official quantized models collection of Qwen3 Quantization • 43 items • Updated May 12 • 6
Accurate LoRA-Finetuning Quantization of LLMs via Information Retention Paper • 2402.05445 • Published Feb 8, 2024 • 1
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis Paper • 2405.14224 • Published May 23, 2024 • 17
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published Apr 22, 2024 • 46
LLaMA3-Quantization Collection This is the official quantized models collection of “How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study” • 9 items • Updated Apr 23, 2024 • 4
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs Paper • 2402.04291 • Published Feb 6, 2024 • 51