π Ichigo v0.5 Collection The experimental family designed to train LLMs to understand sound natively. β’ 2 items β’ Updated 15 days ago β’ 4
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant Paper β’ 2410.15316 β’ Published Oct 20, 2024 β’ 11
π Ichigo v0.4 Collection The experimental family designed to train LLMs to understand sound natively. β’ 3 items β’ Updated 15 days ago β’ 8
view article Article Fine-Tune a Semantic Segmentation Model with a Custom Dataset Mar 17, 2022 β’ 20
X2I Dataset Collection Datasets used in OmniGen-v1. (v2 is coming soon :) ) β’ 5 items β’ Updated 9 days ago β’ 16
Paper Trial - 2025 Collection 2025 Weekly Paper Reading Challenge: This is a journey of reading and sharing insightful research papers throughout 2025. β’ 11 items β’ Updated 8 days ago β’ 3
BitNet Collection π₯BitNet family of large language models (1-bit LLMs). β’ 7 items β’ Updated 7 days ago β’ 36
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). β’ 13 items β’ Updated 2 days ago β’ 17
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory β’ 15 items β’ Updated 19 days ago β’ 186
SVDQuant Collection Models and datasets for "SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models" β’ 20 items β’ Updated Mar 17 β’ 34
distil-large-v3.5 Collection This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. β’ 5 items β’ Updated Mar 25 β’ 7