view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs 9 days ago • 22
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation Paper • 2502.10341 • Published Feb 14 • 2
Aioli: A Unified Optimization Framework for Language Model Data Mixing Paper • 2411.05735 • Published Nov 8, 2024 • 1
Cell2Sentence Models Collection Cell2Sentence models trained for single-cell tasks • 5 items • Updated 21 days ago • 7
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published Mar 10 • 61
🏜️MIRAGE-Bench [NAACL'25] Collection Dataset Collection from the MIRAGE-Bench paper • 13 items • Updated Mar 31 • 2
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning Paper • 2504.11456 • Published 22 days ago • 12
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 7 days ago • 13
Apriel Collection ServiceNow Language Modeling Lab's first model family series • 3 items • Updated about 21 hours ago • 8
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 13 items • Updated 2 days ago • 17