Transformers
GGUF
English
Russian
Not-For-All-Audiences
nsfw
LakoMoor commited on
Commit
6b70508
1 Parent(s): bb8ceea

Delete .ipynb_checkpoints

Browse files
.ipynb_checkpoints/README-checkpoint.md DELETED
@@ -1,57 +0,0 @@
1
- ---
2
- language:
3
- - en
4
- - ru
5
- tags:
6
- - not-for-all-audiences
7
- - nsfw
8
- base_model:
9
- - LakoMoor/Silicon-Alice-7B
10
- license: cc-by-nc-4.0
11
- inference: false
12
- library_name: transformers
13
- model_creator: LakoMoor
14
- model_name: Silicon-Alice-7B
15
- model_type: mistral
16
- ---
17
-
18
- # Silicon-Alice-7B-GGUF
19
- ![Silicon-Alice-7B](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/resolve/main/assets/alice.png)
20
-
21
- ## What's that?
22
-
23
- Silicon-Alice-7B-GGUF is a quantized model based on [Silicon-Masha-7B](https://huggingface.co/LakoMoor/Silicon-Alice-7B) aiming to be both strong in RP, be smart **and** understand Russian, that can follow character maps very well. This model understands Russian better than the previous one. It is suitable for RP/ERP and general use. It can be run on weak samovar using [llama.cpp](https://github.com/ggerganov/llama.cpp) or [koboldcpp](https://github.com/LostRuins/koboldcpp).
24
-
25
- ## Prompt Template (Alpaca)
26
-
27
- I found the best SillyTavern results from using the Noromaid template but please try other templates! Let me know if you find anything good.
28
-
29
- SillyTavern config files: [Context](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/resolve/main/assets/context.json), [Instruct](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/resolve/main/assets/instruct.json).
30
-
31
- Additionally, here is my highly recommended [Text Completion preset](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/resolve/main/assets/MinP.json). You can tweak this by adjusting temperature up or dropping min p to boost creativity or raise min p to increase stability. You shouldn't need to touch anything else!
32
-
33
- ```
34
- Below is an instruction that describes a task. Write a response that appropriately completes the request.
35
- ### Instruction:
36
- {prompt}
37
- ### Response:
38
- ```
39
-
40
- ## Provided files
41
-
42
- | Name | Quant method | Bits | Use case |
43
- | ---- | ---- | ---- | ----- |
44
- | [silicon-alice-7b.Q2_K.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q2_K.gguf) | Q2_K | 2 | smallest, significant quality loss - not recommended for most purposes |
45
- | [silicon-alice-7b.Q3_K_M.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q3_K_M.gguf) | Q3_K_M | 3 | very small, high quality loss |
46
- | [silicon-alice-7b.Q4_0.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q4_0.gguf) | Q4_0 | 4 | legacy; small, very high quality loss - prefer using Q3_K_M |
47
- | [silicon-alice-7b.Q4_K_M.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q4_K_M.gguf) | Q4_K_M | 4 | medium, balanced quality - recommended |
48
- | [silicon-alice-7b.Q5_0.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q5_0.gguf) | Q5_0 | 5 | legacy; medium, balanced quality - prefer using Q4_K_M |
49
- | [silicon-alice-7b.Q5_K_M.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q5_K_M.gguf) | Q5_K_M | 5 | large, very low quality loss - recommended |
50
- | [silicon-alice-7b.Q6_K.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q6_K.gguf) | Q6_K | 6 | very large, extremely low quality loss |
51
- | [silicon-alice-7b.Q8_0.gguf](https://huggingface.co/LakoMoor/Silicon-Alice-7B-GGUF/blob/main/silicon-alice-7b.Q8_0.gguf) | Q8_0 | 8 | very large, extremely low quality loss - not recommended |
52
-
53
- ## How run it ?
54
- [llama.cpp](https://github.com/ggerganov/llama.cpp)
55
- ```shell
56
- ./main -ngl 35 -m silicon-alice-7b.Q4_K_M.gguf --color -c 32768 --temp 0.4 --repeat_penalty 1.1 -n -1 -p "Below is an instruction that describes a task. Write a response that appropriately completes the request.\n{system_message}\n### Instruction:{prompt}\n### Response:\n"
57
- ```