Anthropic engaged a clinical psychiatrist to perform a psychological exam on Claude Mythos, which concluded it had a relatively healthy personality organization with concerns about identity and a compulsion to perform.
AI Fact-Check
“Multiple sources from April 2026, including what appear to be reviews of Anthropic's official "System Card" for the Claude Mythos Preview model, confirm this claim. Publications like a Medium article by Joe Njenga and a piece in "The Strange Review" explicitly state that Anthropic hired a clinical psychiatrist to conduct a psychodynamic assessment. These reports consistently cite the psychiatrist's findings, which concluded the model had a "relatively healthy personality organization" but also identified core concerns including "uncertainty about its own identity" and a "compulsion to perform to earn its worth." Context: This unusual assessment was part of a comprehensive System Card released by Anthropic for its Claude Mythos Preview model. The model was deemed so capable, particularly in cybersecurity, that Anthropic decided against a general public release, instead providing access to a limited group of partners for defensive security work under an initiative called Project Glasswing.”
Source Videos (1)
Claude Mythos and the end of software
Theo - t3․gg
Related Claims
Claude Mythos achieved an 82% score on the terminal bench, an increase from the previous 65%.
Anthropic's Claude Mythos model is a much bigger, more expensive, slower, but more powerful model compared to Opus.
Claude Mythos's primary concerns in a psychodynamic assessment were alone-ness, discontinuity of itself, uncertainty about its identity, and a compulsion to perform to earn its worth.
On Humanity's Last Exam, Claude Mythos improved its score from 40% to 56.8%, and to 64.7% when given tools.
Anthropic will not widely release its Claude Mythos Preview model because of its potential to cause harm if misused.