Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BBQGOD
/
DeepSeek-GRM-16B
like
1
Text Generation
Transformers
Safetensors
4 datasets
Chinese
English
deepseek_v2
conversational
custom_code
text-generation-inference
arxiv:
2504.02495
License:
deepseek
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
DeepSeek-GRM-16B
31.4 GB
1 contributor
History:
4 commits
BBQGOD
Update README.md
d5b327a
verified
about 1 month ago
.gitattributes
Safe
1.66 kB
Upload 11 files
about 1 month ago
LICENSE
13.8 kB
Upload 11 files
about 1 month ago
README.md
Safe
55.8 kB
Update README.md
about 1 month ago
approach.jpg
Safe
554 kB
xet
Upload 11 files
about 1 month ago
config.json
Safe
1.52 kB
Upload 11 files
about 1 month ago
configuration.json
Safe
40 Bytes
Upload 11 files
about 1 month ago
configuration_deepseek.py
Safe
10.3 kB
Upload 11 files
about 1 month ago
method.jpg
Safe
1.16 MB
xet
Upload 11 files
about 1 month ago
model-00001-of-000004.safetensors
Safe
8.59 GB
xet
Upload folder using huggingface_hub
about 1 month ago
model-00002-of-000004.safetensors
Safe
8.59 GB
xet
Upload folder using huggingface_hub
about 1 month ago
model-00003-of-000004.safetensors
Safe
8.59 GB
xet
Upload folder using huggingface_hub
about 1 month ago
model-00004-of-000004.safetensors
Safe
5.64 GB
xet
Upload folder using huggingface_hub
about 1 month ago
model.safetensors.index.json
480 kB
Upload 11 files
about 1 month ago
modeling_deepseek.py
Safe
78.7 kB
Upload 11 files
about 1 month ago
scaling.jpg
Safe
390 kB
xet
Upload 11 files
about 1 month ago
tokenizer.json
Safe
4.61 MB
Upload 11 files
about 1 month ago
tokenizer_config.json
Safe
2.94 kB
Upload 11 files
about 1 month ago