Tags: #ai-safety
AI Model Utility
pytorch
20.5k
p-e-w/heretic
Heretic is an AI model utility that automatically removes censorship and safety alignment from transformer-based language models without requiring expensive post-training.
Heretic is an AI model utility that automatically removes censorship and safety alignment from transformer-based language models without requiring expensive post-training.