QuantFactory
/

gemma-2-27b-it-abliterated-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Edit model card

QuantFactory/gemma-2-27b-it-abliterated-GGUF

This is quantized version of byroneverson/gemma-2-27b-it-abliterated created using llama.cpp

Original Model Card

gemma-2-27b-it-abliterated

Now accepting abliteration requests. If you would like to see a model abliterated, follow me and leave me a message with model link.

This is a new approach for abliterating models using CPU only. I was able to abliterate this model using free kaggle processing with no accelerator.

Obtain refusal direction vector using a quant model with llama.cpp (llama-cpp-python and ggml-python).
Orthogonalize each .safetensors files directly from original repo and upload to a new repo. (one at a time)

Check out the jupyter notebook for details of how this model was abliterated from gemma-2-27b-it.

Downloads last month: 340

GGUF

Model size

27.2B params

Architecture

gemma2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Examples

Text Generation

Inference API (serverless) is not available, repository is disabled.

Model tree for QuantFactory/gemma-2-27b-it-abliterated-GGUF

Base model

google/gemma-2-27b

Finetuned

google/gemma-2-27b-it

Quantized

this model