Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26 • 48
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 104
Tessa-T1 REACT REASONING MODEL Collection Tessa-T1 is a model that generates Stateful React with tailwind styling. It has features of other libraries as well. It is based on Qwen2.5-Coder. • 5 items • Updated Mar 24 • 7
SLM Judge Models Collection Base model(s) merged with the specific evaluation task adapter. Each model performs excellently for its purpose and remains useful for general tasks. • 6 items • Updated Feb 18 • 1
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 408
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published Mar 6 • 15