Edit model card
Name Quant method Size
Ae-calem-mistral-7b-v0.2_f16.gguf fp16 14.6 GB
Ae-calem-mistral-7b-v0.2_8bit.gguf q8_0 7.75 GB
Downloads last month
10
GGUF
Model size
7.29B params
Architecture
llama

16-bit

Inference Examples
Inference API (serverless) is not available, repository is disabled.