Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2507.09404

July 2025 - Top Papers

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 256
MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 153
4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 104
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 91

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published May 20 • 52
Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8 • 70
Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20 • 29
Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 44
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 48
Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122

LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18, 2024 • 34
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 39
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20, 2024 • 13
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1, 2024 • 34

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published Jun 24 • 52
Data Efficacy for Language Model Training

Paper • 2506.21545 • Published Jun 26 • 11
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 47
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Paper • 2507.03253 • Published Jul 4 • 18

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.28k • 1.19k
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14 • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

Text Generation • 8B • Updated Apr 17 • 848 • 15
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 62

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 30
Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 51
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 60
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 19

July 2025 - Top Papers

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 256
MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 153
4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 104
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 91

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published Jun 24 • 52
Data Efficacy for Language Model Training

Paper • 2506.21545 • Published Jun 26 • 11
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 47
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Paper • 2507.03253 • Published Jul 4 • 18

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published May 20 • 52
Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8 • 70
Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.28k • 1.19k
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14 • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

Text Generation • 8B • Updated Apr 17 • 848 • 15
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 62

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20 • 29
Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 44
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 48
Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 30
Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 51
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 60
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 19

LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18, 2024 • 34
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 39
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20, 2024 • 13
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1, 2024 • 34

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs