Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
jbjeong91
/
llama3.1-cpo-full-0919
like
0
Text Generation
Transformers
TensorBoard
Safetensors
princeton-nlp/llama3-ultrafeedback
llama
alignment-handbook
trl
cpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
llama3.1
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
ecf66fe
llama3.1-cpo-full-0919
Commit History
Model save
ecf66fe
verified
jbjeong91
commited on
1 day ago
Training in progress, step 43
e9f63ab
verified
jbjeong91
commited on
1 day ago
Training in progress, step 40
eb085a0
verified
jbjeong91
commited on
1 day ago
Training in progress, step 30
b4bb70d
verified
jbjeong91
commited on
1 day ago
Training in progress, step 20
e16690e
verified
jbjeong91
commited on
1 day ago
Training in progress, step 10
0d09143
verified
jbjeong91
commited on
1 day ago
Training in progress, step 40
1f32534
verified
jbjeong91
commited on
1 day ago
Training in progress, step 30
793e06f
verified
jbjeong91
commited on
1 day ago
Training in progress, step 20
ab745cf
verified
jbjeong91
commited on
1 day ago
Training in progress, step 10
60abad8
verified
jbjeong91
commited on
1 day ago
initial commit
5f0352b
verified
jbjeong91
commited on
1 day ago