-
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report
Paper • 2508.01059 • Published • 33 -
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Paper • 2508.01191 • Published • 234 -
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
Paper • 2508.05629 • Published • 170 -
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 173
Jason
songsh
·
AI & ML interests
None yet
Recent Activity
updated
a collection
17 days ago
research-catchup
updated
a collection
17 days ago
research-catchup
updated
a collection
17 days ago
research-catchup
Organizations
None yet