Jade's picture

Jade

euclaise

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

upvoted a paper about 8 hours ago

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

upvoted a paper about 8 hours ago

Practical Efficiency of Muon for Pretraining

View all activity

Organizations

euclaise's activity

upvoted 3 papers about 8 hours ago

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

Paper • 2504.21659 • Published 9 days ago • 11

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

Paper • 2504.20605 • Published 10 days ago • 13

Practical Efficiency of Muon for Pretraining

Paper • 2505.02222 • Published 4 days ago • 36

liked a model about 10 hours ago

JetBrains/Mellum-4b-base

Text Generation • Updated 2 days ago • 2.2k • 297

liked 2 datasets 1 day ago

nvidia/OpenCodeReasoning

Viewer • Updated 4 days ago • 753k • 18k • 369

nvidia/OpenMathReasoning

Viewer • Updated 15 days ago • 5.47M • 30.1k • 199

liked a model 1 day ago

deepseek-ai/DeepSeek-Prover-V2-671B

Text Generation • Updated 9 days ago • 6.24k • • 739

liked a dataset 1 day ago

nvidia/Nemotron-CrossThink

Preview • Updated 8 days ago • 5.54k • 74

upvoted 2 papers 6 days ago

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

Paper • 2504.20966 • Published 10 days ago • 25

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published 8 days ago • 29

liked 3 datasets 19 days ago

Gryphe/Opus-WritingPrompts

Viewer • Updated Jan 9 • 6.02k • 181 • 63

GeneralReasoning/GeneralThought-430K

Viewer • Updated Mar 14 • 431k • 16.9k • 39

davanstrien/fine-reasoning-questions

Viewer • Updated 24 days ago • 244 • 752 • 16

upvoted 4 papers 21 days ago

Reasoning Models Can Be Effective Without Thinking

Paper • 2504.09858 • Published 25 days ago • 10

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published 24 days ago • 16

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Paper • 2504.11393 • Published 24 days ago • 17

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 25 days ago • 84

liked a dataset 28 days ago

trl-lib/tldr-preference

Viewer • Updated Jan 8 • 179k • 584 • 2

upvoted 2 papers about 1 month ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7 • 25

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1 • 13