Q8_0 model weights less than Q2_0

by ivanstepanovftw - opened Apr 5

Discussion

ivanstepanovftw

Apr 5

Uploaded gemma-7b-it.Q8_0.gguf file have 3.41 GB size, which is less than any other quantized models.

brittlewis12

Owner Apr 21

@ivanstepanovftw thanks for the heads up!

I'm not sure how that happened to start with, but I've reconverted the q8_0 and it's uploading now, should be available within ~5 minutes. Sorry for any inconvenience!

I'm also converting the gemma 1.1 models soon, and adding imatrix quants to go with them!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment