Detecting and preventing distillation attacks

February 23, 2026

Anthropic is an AI safety and research company that’s working to build reliable, interpretable, and steerable AI systems.