File size: 885 Bytes
119d6de
 
 
 
 
 
32d1381
ac939d5
32d1381
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6f8d538
a3f432f
6f8d538
119d6de
6f8d538
119d6de
32d1381
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
license: other
license_name: flux-1-dev-non-commercial-license
license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
---

Main page: https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/981

---

Update:

Always use V2 by default. 

V2 is quantized in a better way to turn off the second stage of double quant. 

V2 is 0.5 GB larger than the previous version, since the chunk 64 norm is now stored in full precision float32, making it much more precise than the previous version. Also, since V2 does not have second compression stage, it now has less computation overhead for on-the-fly decompression, making the inference a bit faster.

The only drawback of V2 is being 0.5 GB larger.

---

Main model in bnb-nf4 (v1 with chunk 64 norm in nf4, v2 with chunk 64 norm in float32)

T5xxl in fp8e4m3fn

CLIP-L in fp16

VAE in bf16