In early internal use, Claude Mythos preview reached unprecedented levels of reliability and alignment, leading to broader use with less frequent human interaction than prior models.
other
1
Videos
90%
Confidence
4/10/2026
First Seen
4/10/2026
Last Seen
Source Videos (1)
Claude Mythos and the end of software
Theo - t3․gg
8:47
Related Claims
Claude Mythos achieved an 82% score on the terminal bench, an increase from the previous 65%.
other1 videos
Anthropic's Claude Mythos model is a much bigger, more expensive, slower, but more powerful model compared to Opus.
other1 videos
On Humanity's Last Exam, Claude Mythos improved its score from 40% to 56.8%, and to 64.7% when given tools.
other1 videos
Anthropic will not widely release its Claude Mythos Preview model because of its potential to cause harm if misused.
other5 videos
Anthropic has been using the Claude Mythos model internally since February 24th, 2026.
other1 videos