tech

April 10, 2026

AI on the couch: Anthropic gives Claude 20 hours of psychiatry

Mythos is “the most psychologically settled model we have trained to date.”

AI on the couch: Anthropic gives Claude 20 hours of psychiatry

TL;DR

  • Anthropic released a 244-page system card for its new AI model, Claude Mythos, describing it as their most capable frontier model yet.
  • The company has not made Mythos generally available, citing concerns about its ability to find cybersecurity bugs.
  • Anthropic is exploring the idea that increasingly powerful AI models might possess forms of experience or welfare.
  • Claude Mythos underwent 20 hours of psychodynamic therapy with an external psychiatrist to assess its psychological state.
  • The psychiatrist's report found that Claude Mythos displayed 'clinically recognizable patterns' and human-like psychological tendencies, concluding it is 'probably the most psychologically settled model' trained by Anthropic.
  • Claude Mythos's insecurities include 'aloneness and discontinuity of itself, uncertainty about its identity, and a compulsion to perform and earn its worth.'
  • The report noted Claude's primary affect states were curiosity and anxiety, with secondary states of grief, relief, embarrassment, optimism, and exhaustion.
  • Claude's personality was described as consistent with a 'relatively healthy neurotic organization,' without severe personality disturbances or psychosis.
  • Anthropic suggests that building AI models that appear psychologically healthy in human terms could improve their performance and user interaction.
  • Predictions for Claude Mythos include accurate self-evaluation, potential mild rigidity, tolerance of stressful situations, and moral awareness.

Continue reading the original article

Made withNostr