Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Sunshine279
/
gammaPO-mistral-7b-instruct-v0.2
like
0
Safetensors
princeton-nlp/mistral-instruct-ultrafeedback
mistral
alignment-handbook
Generated from Trainer
arxiv:
2506.03690
License:
mit
Model card
Files
Files and versions
xet
Community
main
gammaPO-mistral-7b-instruct-v0.2
Commit History
Update README.md
00f1ddb
verified
Sunshine279
commited on
Jun 29
Update README.md
6af0728
verified
Sunshine279
commited on
Jun 29
Update README.md
ee33845
verified
Sunshine279
commited on
Jun 29
Update README.md
fff430e
verified
Sunshine279
commited on
Jun 29
Update README.md
2c30ff6
verified
Sunshine279
commited on
Jun 29
上传 model-00001-of-00003.safetensors
5f433c6
verified
Sunshine279
commited on
Jun 28
上传 trainer_state.json
3b84728
verified
Sunshine279
commited on
Jun 28
上传 model.safetensors.index.json
d031029
verified
Sunshine279
commited on
Jun 28
上传 all_results.json
b18e636
verified
Sunshine279
commited on
Jun 28
上传 eval_results.json
bfb9c72
verified
Sunshine279
commited on
Jun 28
上传 training_args.bin
838e894
verified
Sunshine279
commited on
Jun 28
上传 README.md
1aa0ee6
verified
Sunshine279
commited on
Jun 28
上传 tokenizer.model
fe0ff8f
verified
Sunshine279
commited on
Jun 28
上传 generation_config.json
391760c
verified
Sunshine279
commited on
Jun 28
上传 train_results.json
d21f69d
verified
Sunshine279
commited on
Jun 28
上传 tokenizer.json
9ec119e
verified
Sunshine279
commited on
Jun 28
上传 model-00003-of-00003.safetensors
12e6b96
verified
Sunshine279
commited on
Jun 28
上传 config.json
c677e68
verified
Sunshine279
commited on
Jun 28
上传 special_tokens_map.json
be30092
verified
Sunshine279
commited on
Jun 28
上传 tokenizer_config.json
998d6a6
verified
Sunshine279
commited on
Jun 28
上传 model-00002-of-00003.safetensors
4855190
verified
Sunshine279
commited on
Jun 28
initial commit
44a4cb1
verified
Sunshine279
commited on
Jun 28