Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning Paper • 2509.13755 • Published 6 days ago • 18
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs Paper • 2509.09677 • Published 12 days ago • 33
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper • 2508.16153 • Published Aug 22 • 148
MultiRef: Controllable Image Generation with Multiple Visual References Paper • 2508.06905 • Published Aug 9 • 21
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? Paper • 2508.03644 • Published Aug 5 • 26
Janus Collection Janus is a novel autoregressive framework that unifies multimodal understanding and generation. • 8 items • Updated Feb 18 • 16
view article Article Introducing the Chatbot Guardrails Arena By sonalipnaik and 3 others • Mar 21, 2024 • 6
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published Feb 20 • 100
Running 3.24k 3.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published Feb 3 • 114
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 221
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 297
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 284