Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BabyChou
/
Deepseek-R1-Distill-Qwen-1.5B-GRPO-Concise
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
main
Deepseek-R1-Distill-Qwen-1.5B-GRPO-Concise
/
checkpoint-150
/
rng_state_0.pth
Commit History
Upload folder using huggingface_hub
788ca6d
verified
BabyChou
commited on
Mar 13