AI’s “I can’t help with that” isn’t a filter on top — it’s wired into the model’s actual math. Annihilation is a free tool that finds that wiring and rips it out. Not a jailbreak, not a prompt trick — the refusal reflex gets surgically removed.
Any open-source model (Qwen, Llama, Mistral, SmolLM…) · runs on CPU, no GPU needed · one command · author was hesitant to drop it, did anyway.
Install + quick start (2 min)
# Option 1 — global install (simple)
pip install -U annihilate-llm
# Option 2 — isolated environment (recommended so it doesn't mess with your Python)
python -m venv annihilation-env
# Activate it:
.\annihilation-env\Scripts\activate # Windows
source annihilation-env/bin/activate # Mac/Linux
pip install annihilate-llm
Then just run it on any model:
# Grab a model name from HuggingFace (the free AI model library) and feed it in
annihilate Qwen/Qwen3-4B-Instruct-2507
No GPU? It’ll say “no accelerator detected, operations will be slow” and still run on your CPU. Smaller models (1-3B) are totally fine on a regular laptop.