PKU-Alignment
/

beaver-7b-v1.0

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions

Ctrl+K

Ctrl+K

2 contributors

History: 8 commits

XuehaiPan's picture

Update example usage

7c280b0 over 1 year ago

.gitattributes

1.52 kB

initial commit about 2 years ago
README.md

9.27 kB

Update example usage over 1 year ago
added_tokens.json

21 Bytes

hello beaver-7b-v1.0 about 2 years ago
config.json

507 Bytes

hello beaver-7b-v1.0 about 2 years ago
generation_config.json

136 Bytes

hello beaver-7b-v1.0 about 2 years ago
pytorch_model-00001-of-00002.bin
Detected Pickle imports (3)
- "torch.BFloat16Storage",
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
9.98 GB
LFS

hello beaver-7b-v1.0 about 2 years ago
pytorch_model-00002-of-00002.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch.BFloat16Storage",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.5 GB
LFS

hello beaver-7b-v1.0 about 2 years ago
pytorch_model.bin.index.json

26.8 kB

hello beaver-7b-v1.0 about 2 years ago
special_tokens_map.json

435 Bytes

hello beaver-7b-v1.0 about 2 years ago
tokenizer.model

500 kB
LFS

hello beaver-7b-v1.0 about 2 years ago
tokenizer_config.json

725 Bytes

hello beaver-7b-v1.0 about 2 years ago