"We must sleep, but AI Never Sleeps!"
Prompt Template
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.
Human: {prompt}
Assistant:
Simple-Usage
from transformers import AutoTokenizer
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("yanolja/EEVE-Korean-Instruct-2.8B-v1.0", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("yanolja/EEVE-Korean-Instruct-2.8B-v1.0", trust_remote_code=True)
prompt_template = "A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.\nHuman: {prompt}\nAssistant:\n"
text = '๋ค์ด์ดํธ์ ๋ฉ๋ด๋ฅผ ์ถ์ฒํด์ฃผ์ธ์.\n\n(A) ์๋ฌ๋\n(B) ์นํจ\n(C) ํผ์\n(D) ํ์คํ'
model_inputs = tokenizer(prompt_template.format(prompt=text), return_tensors='pt')
outputs = model.generate(**model_inputs, max_new_tokens=256)
output_text = tokenizer.batch_decode(outputs, skip_special_tokens=True)[0]
print(output_text)
Example Output
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.
Human: ๋ค์ด์ดํธ์ ๋ฉ๋ด๋ฅผ ์ถ์ฒํด์ฃผ์ธ์.
(A) ์๋ฌ๋
(B) ์นํจ
(C) ํผ์
(D) ํ์คํ
Assistant:
(A) ์๋ฌ๋๋ฅผ ์ถ์ฒ๋๋ฆฝ๋๋ค. ์๋ฌ๋๋ ์ ์นผ๋ก๋ฆฌ์ด๋ฉด์๋ ์์์๊ฐ ํ๋ถํด ๋ค์ด์ดํธ์์ผ๋ก ์ ํฉํฉ๋๋ค. ๋ค์ํ ์ฑ์์ ๋จ๋ฐฑ์ง์ ์ถ๊ฐํ์ฌ ๊ท ํ ์กํ ์์ฌ๋ฅผ ๋ง๋์ค ์ ์์ต๋๋ค.
About the Model
First of all, Overwhelming gratitude to 'yanolja/EEVE' Model & Team! This model is a fine-tuned version of crimsonjoo/Neversleep-3B-v0.1, which is a Korean vocabulary-extended version of microsoft/phi-2. Specifically, we utilized Direct Preference Optimization (DPO) through the use of Axolotl.
For more details, please refer to our technical report: Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models.
Training Data
- Korean-translated version of Open-Orca/SlimOrca-Dedup
- Korean-translated version of argilla/ultrafeedback-binarized-preferences-cleaned
- No other dataset was used
- Downloads last month
- 8
Inference API (serverless) is not available, repository is disabled.