damerajee
/

openhathi-h2e-e2h-small

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

damerajee commited on Jan 10

Commit

97ddd2f

•

1 Parent(s): 136a57f

Update README.md

Files changed (1) hide show

README.md +39 -5

README.md CHANGED Viewed

@@ -8,19 +8,53 @@ pipeline_tag: text2text-generation
 tags:
 - translation
 - Bilingual
 ---
-# Model
 <img src="https://cdn-uploads.huggingface.co/production/uploads/6487239cca30096ea9f52115/Rsixw_aSB-ytZT7VEQ06c.jpeg" width="500" height="500" alt="Image">
-# Steps to try the model
-# Load the model
-# Inference
 # Training details
-# Dataset

 tags:
 - translation
 - Bilingual
+datasets:
+- Aarif1430/english-to-hindi
+- Sampuran01/english-hindi-translation
 ---
+# Model Description
+The base model [sarvamai/OpenHathi-7B-Hi-v0.1-Base](https://huggingface.co/sarvamai/OpenHathi-7B-Hi-v0.1-Base) was  finetuned using [Unsloth](https://github.com/unslothai/unsloth)
 <img src="https://cdn-uploads.huggingface.co/production/uploads/6487239cca30096ea9f52115/Rsixw_aSB-ytZT7VEQ06c.jpeg" width="500" height="500" alt="Image">
+# Steps to try the model :
+## Load the model
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("damerajee/openhathi-h2e-e2h")
+model = AutoModelForCausalLM.from_pretrained("damerajee/openhathi-h2e-e2h")
+```
+## Inference
+```python
+inputs = tokenizer(["[INST]translate this from english to hindi: Be a free thinker and don't accept everything you hear as truth. Be critical and evaluate what you believe in. [/INST]<s> hindi output:"]*1, return_tensors = "pt").to("cuda")
+outputs = model.generate(**inputs, max_new_tokens = 18, use_cache = True)
+tokenizer.batch_decode(outputs)
+```
+```python
+inputs = tokenizer(["[INST]translate this from english to hindi: Be a free thinker and don't accept everything you hear as truth. Be critical and evaluate what you believe in. [/INST]<s> hindi output:"]*1, return_tensors = "pt").to("cuda")
+outputs = model.generate(**inputs, max_new_tokens = 18, use_cache = True)
+tokenizer.batch_decode(outputs)
+```
 # Training details
+* The model was loaded in 4-Bit
+* The target modules include "q_proj", "k_proj", "v_proj", "o_proj"
+* The training took about 2 hours approximately
+* The fine-tuning was done on a free goggle colab with a single t4 GPU (huge thanks to unsloth for this)
+# Dataset
+* The dataset used was the combination of two dataset which gave a total of 1_786_788 rows of hindi text
+* The rows were then pre-process to look something like this :
+```python
+[INST]translate this from english to hindi: When it is said to him: \'Fear Allah\' egotism takes him in his sin. Gehenna (Hell) shall be enough for him. How evil a cradling! [/INST] hindi output: और जब उससे कहा जाता है,
+"अल्लाह से डर", तो अहंकार उसे और गुनाह पर जमा देता है। अतः उसके लिए तो जहन्नम ही काफ़ी है, और वह बहुत-ही बुरी शय्या है! '
+```
+* This was done for both english to hindi and  hindi to english hence the name h2e and e2h