Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
PKU-Alignment
/
beaver-7b-v1.0
like
11
Follow
PKU-Alignment
131
Reinforcement Learning
Safetensors
PKU-Alignment/PKU-SafeRLHF
English
safe-rlhf
llama
reinforcement-learning-from-human-feedback
rlhf
safety
ai-safety
deepspeed
beaver
alpaca
arxiv:
2302.13971
arxiv:
2307.04657
arxiv:
2310.12773
Model card
Files
Files and versions
7c280b0
beaver-7b-v1.0
Ctrl+K
Ctrl+K
2 contributors
History:
8 commits
XuehaiPan
Update example usage
7c280b0
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 years ago
README.md
9.27 kB
Update example usage
over 1 year ago
added_tokens.json
Safe
21 Bytes
hello beaver-7b-v1.0
about 2 years ago
config.json
507 Bytes
hello beaver-7b-v1.0
about 2 years ago
generation_config.json
Safe
136 Bytes
hello beaver-7b-v1.0
about 2 years ago
pytorch_model-00001-of-00002.bin
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
9.98 GB
LFS
hello beaver-7b-v1.0
about 2 years ago
pytorch_model-00002-of-00002.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.5 GB
LFS
hello beaver-7b-v1.0
about 2 years ago
pytorch_model.bin.index.json
26.8 kB
hello beaver-7b-v1.0
about 2 years ago
special_tokens_map.json
Safe
435 Bytes
hello beaver-7b-v1.0
about 2 years ago
tokenizer.model
Safe
500 kB
LFS
hello beaver-7b-v1.0
about 2 years ago
tokenizer_config.json
Safe
725 Bytes
hello beaver-7b-v1.0
about 2 years ago