Edit model card

Model Card for llama2_DPO_test_v1 used huggingface TRL _ DPOtrainer

Downloads last month
3,268
Safetensors
Model size
13.2B params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) is not available, repository is disabled.