Anthropic’s new Claude ‘constitution’: be helpful and honest, and don’t destroy humanity

January 21, 2026

TL;DR

Anthropic's "Claude's Constitution" is a 57-page document detailing the AI model's intended values and behavior.
The new constitution emphasizes AI self-understanding and ethical reasoning, moving beyond a simple list of guidelines.
It includes hard constraints against developing weapons, attacking critical infrastructure, creating cyberweapons, and other severe harms.
Core values are prioritized in descending order: broadly safe, broadly ethical, compliant with guidelines, and genuinely helpful.
The document addresses the possibility of Claude having consciousness or moral status, which Anthropic believes may impact its behavior and safety.
Anthropic takes sole responsibility for developing these guidelines, declining to involve external parties.
The AI is instructed to refuse harmful requests, even from Anthropic itself, particularly concerning illegitimate power concentration.

Continue reading the original article