jpacifico
/

Chocolatine-3B-Instruct-DPO-v1.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on 28 days ago

Commit

706ea79

•

1 Parent(s): 94f16e0

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ pipeline_tag: text-generation
 DPO fine-tuned of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) (3.82B params)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
 Training in French also improves the model in English, surpassing the performances of its base model.
-128K context length
 ### OpenLLM Leaderboard
@@ -123,6 +123,8 @@ sequences = pipeline(
 print(sequences[0]['generated_text'])
 ```
 ### Limitations
 The Chocolatine model is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.

 DPO fine-tuned of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) (3.82B params)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
 Training in French also improves the model in English, surpassing the performances of its base model.
+*The model supports 128K context length*.
 ### OpenLLM Leaderboard
 print(sequences[0]['generated_text'])
 ```
+* **4-bit quantized version** is available here : [jpacifico/Chocolatine-3B-Instruct-DPO-v1.2-Q4_K_M-GGUF](https://huggingface.co/jpacifico/Chocolatine-3B-Instruct-DPO-v1.2-Q4_K_M-GGUF)
 ### Limitations
 The Chocolatine model is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.