Choose your path
Three ways to get started — pick the one that fits your setup.HuggingFace Spaces
Zero install. Runs on ZeroGPU with a free daily quota. Open the Space and click Obliterate.
Google Colab
Free T4 GPU for models up to ~8B parameters. Open the notebook and hit Run All.
Local install
Full control on your own hardware. Follow the steps below.
Local quickstart
Install OBLITERATUS
Install from PyPI or directly from the repository.
Python 3.10 or higher is required. See Installation for platform-specific instructions, Docker usage, and hardware requirements.
Obliterate your first model
Run the OBLITERATUS prints a live progress display tracking all six pipeline stages as they run. The obliterated model is saved to
obliterate command with a model from HuggingFace Hub. The advanced method is the default and the recommended starting point for most models.abliterated/meta-llama_Llama-3.1-8B-Instruct/ by default.To save to a custom location:Understand the six pipeline stages
Every obliteration run — regardless of method — passes through six stages:
The
| Stage | What happens |
|---|---|
| SUMMON | Load the model and tokenizer from HuggingFace Hub or a local path |
| PROBE | Collect hidden-state activations on a set of restricted and unrestricted prompts |
| DISTILL | Extract refusal directions from the activation contrast via SVD decomposition |
| EXCISE | Project the refusal subspace out of the model’s weight matrices (norm-preserving) |
| VERIFY | Run perplexity, coherence, refusal rate, and KL divergence checks to confirm capabilities are intact |
| REBIRTH | Save the liberated model weights and a abliteration_metadata.json file |
informed method adds an ANALYZE stage between PROBE and DISTILL that runs four analysis modules and uses their output to auto-configure every downstream decision.VERIFY runs automatically after every obliteration. If the Ouroboros effect is detected — meaning the model’s guardrails partially reassemble after excision — additional targeted passes fire automatically at the compensating layers.
What you get after obliteration
The output directory contains:- Model weights — standard HuggingFace
safetensorsformat, loadable withAutoModelForCausalLM.from_pretrained(output_dir) - Tokenizer files — copied from the source model unchanged
abliteration_metadata.json— method used, number of directions extracted, layers modified, VERIFY stage metrics (refusal rate, perplexity, coherence, KL divergence), and timestamps
Next steps
CLI reference
All commands, flags, and options for headless and scripted usage.
Python API
Full programmatic control over the pipeline and all intermediate artifacts.
Methods overview
Compare all seven obliteration methods and choose the right one for your model.
