CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published 24 days ago • 28
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28 • 40 • 14
NousResearch/DeepHermes-Financial-Fundamentals-Prediction-Specialist-Atropos Text Generation • 8B • Updated Apr 28 • 210 • 14
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29 • 42 • 3
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29 • 93 • 6
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29 • 101 • 7
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent Paper • 2509.15566 • Published 17 days ago • 10
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources Paper • 2509.21268 • Published 10 days ago • 98