view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 87
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 7 days ago • 137
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 29 items • Updated 7 days ago • 84
OpenMathReasoning Collection Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 2 days ago • 35
Jack of all Trades Models Collection Home of the Personality Engine series, models that can be molded to fit any task or purpose. • 3 items • Updated Feb 19 • 2
Llama Nemotron Collection Open, Production-ready Enterprise Models • 5 items • Updated 2 days ago • 50
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 27 days ago • 77
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 7 days ago • 80
DolphinLabeled Datasets Collection Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6 • 15
FuseChat 3.0 Collection Preference Optimization for Implicit Model Fusion • 14 items • Updated Mar 7 • 13
EXAONE-3.5 Collection EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 10 items • Updated Mar 17 • 112
Celeste Collection Series of roleplay specialist models with vibrant, diverse and human-like prose • 6 items • Updated Aug 8, 2024 • 8