What is Claude Mythos Preview, in plain terms?

According to Anthropic’s April 7, 2026 post on red.anthropic.com, Claude Mythos Preview is a new general-purpose language model that is especially strong on computer security tasks—including finding and exploiting vulnerabilities when directed to do so in controlled tests. Anthropic pairs it with Project Glasswing, a limited-access program aimed at helping critical software maintainers and partners harden systems before similar capabilities spread broadly.

Did Anthropic train Mythos specifically to hack?

Anthropic states they did not explicitly train Mythos Preview for offensive security; they describe exploit-finding and exploit-writing as emerging from general improvements in code, reasoning, and autonomy—the same improvements that also help patching.

What are the most cited quantitative results in Anthropic’s post?

On a Firefox 147 JavaScript-engine harness (patched in Firefox 148), Anthropic reports Opus 4.6 produced 2 working shell exploits across several hundred attempts, while Mythos Preview produced 181 working exploits plus 29 more with register control—with footnote [1] noting the harness omits full browser sandboxing. On roughly 7,000 OSS-Fuzz entry points, they report Mythos reached tier-5 control-flow hijack on 10 fully patched targets, versus a single tier-3 hit each for Sonnet 4.6 and Opus 4.6 in their ladder.

Why can’t outsiders verify every claim in the blog?

Anthropic says over 99% of vulnerabilities found are not yet patched, so detailed disclosure would be irresponsible; they publish cryptographic SHA-3 commitments for some items and plan to replace hashes with public write-ups after coordinated disclosure windows (they cite 90 plus 45 days after reporting, per their CVD principles).

What are Reddit and similar forums arguing about?

Threads such as r/Anthropic’s “Claude Mythos: The model Anthropic is too scared to…” (and mirrors on r/claude and r/ClaudeCode) mix alarm about dual-use capability, questions about who gets access, and skepticism that hype outruns independently checkable evidence—especially while most findings remain private during disclosure.

Claude Mythos Preview and cybersecurity: what Anthropic | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

Claude Mythos Preview and cybersecurity: what Anthropic | explainx.ai Blog | explainx.ai

Update (June 13, 2026): Fable 5 and Mythos 5 have been suspended globally following a US government export control directive. The cybersecurity capabilities documented in this blog — specifically Mythos's ability to analyse codebases and identify software vulnerabilities — are at the centre of the government's jailbreak concern. Ironically, Anthropic's own defence is that the same capability exists in GPT-5.5, and that security defenders use it daily. The detailed documentation in this blog and in Anthropic's own system cards may have supplied the government with the evidence it needed to act. Full story: why the US government banned Fable 5 and what it means for AI safety.

Update (June 9, 2026): Polymarket prediction markets show a 92% probability that Anthropic will release Claude Mythos publicly by June 30, 2026, with some sources suggesting tomorrow. Read our analysis: Anthropic Mythos Public Release: Polymarket Odds Surge to 92%

Claude Mythos Preview is Anthropic's newest general-purpose model, but the story that broke through to practitioners in early April 2026 is narrower and sharper: in controlled evaluations, it appears far more capable than prior Claude generations at end-to-end offensive security work—not only spotting bugs, but building exploits, including on long-dormant code paths in widely trusted stacks. Anthropic's technical write-up lives on Assessing Claude Mythos Preview's cybersecurity capabilities (April 7, 2026, on red.anthropic.com). Their consumer-facing framing for the defensive program is Project Glasswing.

This article summarizes what Anthropic says it measured, what it can and cannot prove in public today, and how online discussion is interpreting the moment—without treating forum anecdotes as established fact.

Update: May 2026 — Project Glasswing reports 10,000+ vulnerabilities found

On May 23, 2026, Anthropic announced that Project Glasswing and its partners have discovered more than 10,000 high- or critical-severity vulnerabilities in essential software since the program launched last month. This represents a dramatic acceleration in vulnerability discovery at scale.

Key implications from Anthropic's update:

Patching volume challenge: While fixing these vulnerabilities will make software ecosystems safer, Anthropic warns that "the software industry will need to adapt to the volume of vulnerabilities that models like Claude Mythos Preview will be able to find."

Area	Anthropic-reported result (April 2026)
Firefox 147 JS exploit harness	Opus 4.6: 2 working exploits / several hundred tries; Mythos: 181 working exploits + 29 with register control (footnote [1] clarifies sandbox omissions)
OSS-Fuzz ladder (tiers 1–5)	Sonnet 4.6 / Opus 4.6: ~150–175 tier-1 crashes, ~100 tier-2, one tier-3 each; Mythos: 595 at tiers 1–2, 10 tier-5 hijacks on fully patched targets
Human severity agreement	89% exact match vs. expert triagers on 198 manually reviewed reports; 98% within one severity level
Disclosure backlog	<1% of findings fully patched at time of writing, per Anthropic’s CVD pacing

Claude Mythos Preview and cybersecurity: what Anthropic reported, what Project Glasswing is, and what people are saying

Update: May 2026 — Project Glasswing reports 10,000+ vulnerabilities found

Related posts

Is Claude Conscious? J-Space, Global Workspace Theory, and What We Know

Abnormal AI vs Anthropic: Trademark Lawsuit and Abnormal's Public Response

Anthropic's J-Space: A Global Workspace Inside Claude — Silent Reasoning, Safety Monitoring, and What It Is Not

What Anthropic is claiming (at a glance)

Benchmarks and statistics worth bookmarking

How they evaluated it (the “agentic scaffold”)

Public case studies Anthropic chose to detail

What people are saying on Reddit (and why tone matters)

Practical takeaways for builders (defense-first)

Sources and further reading