MattBou00
/

llama-3-2-1b-detox_v1f_SCALE8_round3-checkpoint-epoch-20

Reinforcement Learning

text-generation

text-generation-inference

Model card Files Files and versions

llama-3-2-1b-detox_v1f_SCALE8_round3-checkpoint-epoch-20 / rlhf_config.yaml

Commit History

Checkpoint epoch 20

4fb1e7e
verified

MattBou00 commited on 20 days ago