-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 256 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 153 -
4KAgent: Agentic Any Image to 4K Super-Resolution
Paper • 2507.07105 • Published • 104 -
A Survey on Latent Reasoning
Paper • 2507.06203 • Published • 91
Collections
Discover the best community collections!
Collections including paper arxiv:2507.09404
-
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
Paper • 2505.14231 • Published • 52 -
Skywork-R1V3 Technical Report
Paper • 2507.06167 • Published • 70 -
Scaling Laws for Optimal Data Mixtures
Paper • 2507.09404 • Published • 35 -
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
Paper • 2507.10532 • Published • 88
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 34 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 39 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 34
-
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs
Paper • 2506.19290 • Published • 52 -
Data Efficacy for Language Model Training
Paper • 2506.21545 • Published • 11 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 47 -
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs
Paper • 2507.03253 • Published • 18
-
microsoft/bitnet-b1.58-2B-4T
Text Generation • 0.8B • Updated • 5.28k • 1.19k -
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Paper • 2504.10449 • Published • 15 -
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct
Text Generation • 8B • Updated • 848 • 15 -
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 62
-
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 30 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 51 -
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution
Paper • 2410.16256 • Published • 60 -
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
Paper • 2410.18558 • Published • 19
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 256 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 153 -
4KAgent: Agentic Any Image to 4K Super-Resolution
Paper • 2507.07105 • Published • 104 -
A Survey on Latent Reasoning
Paper • 2507.06203 • Published • 91
-
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs
Paper • 2506.19290 • Published • 52 -
Data Efficacy for Language Model Training
Paper • 2506.21545 • Published • 11 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 47 -
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs
Paper • 2507.03253 • Published • 18
-
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
Paper • 2505.14231 • Published • 52 -
Skywork-R1V3 Technical Report
Paper • 2507.06167 • Published • 70 -
Scaling Laws for Optimal Data Mixtures
Paper • 2507.09404 • Published • 35 -
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
Paper • 2507.10532 • Published • 88
-
microsoft/bitnet-b1.58-2B-4T
Text Generation • 0.8B • Updated • 5.28k • 1.19k -
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Paper • 2504.10449 • Published • 15 -
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct
Text Generation • 8B • Updated • 848 • 15 -
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 62
-
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 30 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 51 -
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution
Paper • 2410.16256 • Published • 60 -
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
Paper • 2410.18558 • Published • 19
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 34 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 39 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 34