43 149 645

Sayantan Das

ucalyptus

https://ucalyptus.me/

AI & ML interests

Generative Modeling

Recent Activity

updated a Space 1 day ago

ucalyptus/sglang-prefill-decoded-aggregation

published a Space 1 day ago

ucalyptus/sglang-prefill-decoded-aggregation

liked a model 8 days ago

microsoft/Phi-4-mini-reasoning

View all activity

Organizations

ucalyptus's activity

upvoted an article 13 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 624

upvoted a paper 16 days ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 48

upvoted a paper 26 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 104

upvoted a collection about 2 months ago

Tessa-T1 REACT REASONING MODEL

Collection

Tessa-T1 is a model that generates Stateful React with tailwind styling. It has features of other libraries as well. It is based on Qwen2.5-Coder. • 5 items • Updated Mar 24 • 7

upvoted 3 papers about 2 months ago

upvoted a collection about 2 months ago

SLM Judge Models

Collection

Base model(s) merged with the specific evaluation task adapter. Each model performs excellently for its purpose and remains useful for general tasks. • 6 items • Updated Feb 18 • 1

upvoted an article about 2 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 408

upvoted a paper about 2 months ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

upvoted a collection 2 months ago

NuExtract-1.5

Collection

4 items • Updated Nov 15, 2024 • 7

upvoted 3 articles 2 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

• 79

Article

How to deploy and fine-tune DeepSeek models on AWS

Jan 30

• 52

Article

The AI tools for Art Newsletter - Issue 1

Jan 31

• 77