Kooten commited on
Commit
ad83c41
1 Parent(s): 97f77ce

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ tags:
4
+ - not-for-all-audiences
5
+ - nsfw
6
+ ---
7
+ ## Description
8
+
9
+ Exllama 2 quant of [NeverSleep/Echidna-13b-v0.3](https://huggingface.co/NeverSleep/Echidna-13b-v0.3)
10
+
11
+ 8 BPW, Head bit set to 8
12
+
13
+ ## VRAM
14
+ My VRAM usage with 13B models are:
15
+ | Bits per weight | Context | VRAM |
16
+ |--|--|--|
17
+ | 8bpw | 8k | 22gb |
18
+ | 8bpw | 4k | 19gb |
19
+ | 6bpw | 8k | 19gb |
20
+ | 6bpw | 4k | 16gb |
21
+ | 4bpw | 8k | 16gb |
22
+ | 4bpw | 4k | 13gb |
23
+ | 3bpw | 8k | 15gb |
24
+ | 3bpw | 4k | 12gb |
25
+ I have rounded up, these arent exact numbers, this is also on a windows machine, they should be slightly lower on linux.
26
+
27
+ ## Prompt template: Alpaca
28
+
29
+ ```
30
+ Below is an instruction that describes a task. Write a response that appropriately completes the request.
31
+
32
+ ### Instruction:
33
+ {prompt}
34
+
35
+ ### Response:
36
+
37
+ ```