Mozilla just let an unreleased AI rummage through Firefox’s innards—and it came back with a bag full of skeletons. The experiment didn’t just find missed bugs; it quietly signaled that the balance of power between human coders and machine auditors may be flipping for good.


Phase 1: Anthropic Sounds the Alarm

In April, Anthropic unveiled its new security-focused model, Claude Mythos, with a blunt message for the software world: the model was too good at finding vulnerabilities to just dump on the public.

Early tests, Anthropic said, showed Mythos uncovering “thousands of high-severity bugs” across major codebases—enough that the lab decided to hold back broad access until those issues were fixed. That warning set the stage: Mythos wasn’t another coding assistant, it was a weaponized auditor.

Mozilla, which has spent decades hardening Firefox against both attackers and overhyped tools, decided to take Mythos up on the challenge.


Phase 2: Point Mythos at Firefox

Mozilla first tried scanning Firefox with a more general-purpose AI model. The result was interesting but not earth-shattering: that earlier run surfaced just 22 security-sensitive bugs.

Then came the main event. Mozilla gained early access to Claude Mythos Preview, Anthropic’s unreleased security-tuned variant, and pointed it at Firefox’s sprawling browser codebase.

The difference was immediate and brutal. In a blog post and interviews, Mozilla’s security team described Mythos as suddenly capable of:

  • Traversing enormous, legacy codebases
  • Isolating deeply buried logic flaws
  • Proposing fixes that were no longer “plausible-sounding slop” but actionable findings that survived human review

Within a single release cycle, Mythos didn’t just beat the old AI baseline—it obliterated it.


Phase 3: The 271-Bug Shock

Mozilla’s Mythos-assisted audit of Firefox culminated in April 2026. The numbers coming out of that month told the story better than any marketing deck:

  • 423 security bugs fixed in Firefox’s April releases
  • 271 of those fixes tied directly to Mythos’s findings

For a team used to hard-won, incremental progress, this was a jump to another league. In January, Mozilla shipped just 25 bug fixes. March saw 76. April’s 423 wasn’t a steady ramp; it was a vertical line on the chart.

TechCrunch’s reporting underscored how different this felt from the last wave of “AI security tools,” which mostly buried teams in noise. Until recently, AI bug hunters were notorious for flooding trackers with false positives and half-baked suggestions. Mozilla’s researchers say the new models—and the “agentic” techniques used to orchestrate them—finally cleared that hurdle by letting Mythos assess its own work and filter out junk results.

Internally, Mozilla described the shift starkly: “It is difficult to overstate how much this dynamic changed for us over a few short months.”


The 20-Year Ghost and the Sandbox Scares

Numbers are one thing; specific bugs are another. Here, the Mythos run produced a mix of embarrassing archaeology and deeply technical dread.

From Business Insider’s account, Mozilla highlighted 12 bugs in particular, including one that had reportedly survived in the codebase for two decades and “went ‘undetected for years by fuzzers,’” the very tools that are supposed to break software before attackers do.

TechCrunch reported that Mythos had also:

  • Exposed sandbox vulnerabilities—flaws in the browser’s last line of defense, where exploiting them requires chaining multiple, intricate steps
  • Dug up “a 15-year-old error in how the browser parses an HTML element,” the kind of dusty, forgotten logic bug attackers dream about

The sandbox findings were particularly chilling. To find them, the AI effectively had to:

  1. Propose a bad patch that introduced a weakness,
  2. Then figure out how to attack that weakness,
  3. Then generalize that knowledge into a vulnerability pattern.

Mozilla’s engineers still wrote and reviewed the actual patches, but the reconnaissance—the hardest, most time-consuming work—was now being done by a machine.


Phase 4: Mozilla Breaks Its Own Rules

Ordinarily, Mozilla keeps detailed security bug reports private for months after shipping fixes and advisories. That delay exists for a simple reason: plenty of people don’t update their browsers immediately, and publishing working exploit details too soon is asking for trouble.

This time, Mozilla decided that rule no longer fit the moment.

“Given the extraordinary level of interest in this topic and the urgency of action needed throughout the software ecosystem,” Mozilla wrote, “we’ve made the calculated decision to unhide a small sample of the reports behind the fixes we recently shipped.”

Those reports describe some of the 271 vulnerabilities Mythos surfaced in Firefox. The message is blunt: this isn’t just about one browser. If Mythos can carve 271 security-sensitive issues out of a mature, battle-tested codebase like Firefox in a single pass, there’s no reason to assume anyone else is safe.


Phase 5: The Rethink — Trust, Authorship, and Machines on Both Sides

Outside Mozilla, commentators saw something more than a big security win. Nate’s Newsletter framed the Mythos experiment as an early glimpse of a much larger inversion in how software gets built and trusted.

The numbers tell the first part of that story: a previous AI scan (with a general-purpose model) found 22 security-sensitive bugs; Mythos found 271 on the next cycle. For a team that had spent “two decades being skeptical about new tools,” publishing those results with “careful enthusiasm” signals a genuine inflection point, not a hype-driven stunt.

The deeper claim is more provocative: Mythos is “the first serious sign” that the old assumption—humans write code, machines check it—is about to flip. In the emerging pattern:

  • Code may increasingly be generated, attacked, repaired, and verified by machines, end to end.
  • Humans shift from line-by-line authorship to defining what the system is allowed to mean, and which behaviors are acceptable.

That inversion has teeth. If AI-grade adversarial review becomes table stakes, “hand-written code without adversarial machine review starts to look incomplete,” and merely messy codebases start to look structurally unsafe because the very tools that could secure them can’t operate on code nobody can fully understand.

In this view, the window we’re in right now is short and unforgiving: a “refactor window” where teams can still clean up their systems so that powerful AI auditors—and attackers—can’t simply walk through them.


Competing Readings: Breakthrough, Warning, or Both?

Across the coverage, three distinct narratives are emerging.

1. The Optimist’s Take: AI Just Saved Your Browser

From Mozilla’s and TechCrunch’s vantage point, this is a rare, unambiguous security win:

  • Hundreds of bugs, including decade-old and sandbox-level issues, are now fixed instead of festering.
  • The ratio of signal to noise from AI tools has finally crossed a line where they’re not just “cute demos” but core infrastructure.

If you browse the web, you’re safer today than you were before Mythos ran.

2. The Strategist’s Take: Your Codebase Is Next

Mozilla’s decision to publish details on some of the 271 bugs is a not-so-subtle nudge to the rest of the industry: run something like this against your own stack, now.

Anthropic’s choice to delay general release because Mythos is too effective at finding vulnerabilities is also a warning: the capability exists. The only open question is who gets to wield it first and how responsibly they do it.

3. The Systemic Take: Trust Is About to Get Expensive

From the Substack analysis, the core anxiety isn’t that AI can find bugs; it’s that AI will soon be the only thing you truly trust—and also the thing you’re defending against.

When “code is about to get cheap to produce and expensive to trust,” as the newsletter puts it, the winners will be the teams that design around that scarcity of trust on purpose—structuring systems so they can be audited, refactored, and locked down by both humans and machines.


The New Baseline

One Mozilla engineer told TechCrunch, “These things are actually just suddenly very good,” describing a step-change they see “in all sorts of signals across the industry.”

That may be the most important detail in this whole saga. Mythos didn’t just help Firefox ship a big batch of fixes. It quietly reset expectations for what “due care” in software security looks like.

If an AI can find a 20-year-old bug your fuzzers missed, the question isn’t whether you should use it. It’s whether you can still defend not using it at all.

"quotes": [ { "index": 1, "source": "web", "title": "How Anthropic’s Mythos has rewritten Firefox’s approach to cybersecurity", "text": "the lab claimed, that it had discovered thousands of high-severity bugs that would need to be fixed before it could be made public", "url": "https://techcrunch.com/2026/05/07/how-anthropics-mythos-has-rewritten-firefoxs-approach-to-cybersecurity/", "authorPubkey": "22a173d042acd0b75a45b80787fe1771dcbe171c8934cb343af7b219865595d5", "profilePicture": null, "xHandle": null, "xId": null, "xUserName": null }, { "index": 2, "source": "web", "title": "How Anthropic’s Mythos has rewritten Firefox’s approach to cybersecurity", "text": "In April 2026, Firefox shipped 423 bug fixes, compared to just 31 exactly a year earlier… the latest generation of tools have turned a corner, particularly now that agentic systems can assess their own work and filter out bad results.", "url": "https://techcrunch.com/2026/05/07/how-anthropics-mythos-has-rewritten-firefoxs-approach-to-cybersecurity/", "authorPubkey": "22a173d042acd0b75a45b80787fe1771dcbe171c8934cb343af7b219865595d5", "profilePicture": null, "xHandle": null, "xId": null, "xUserName": null }, { "index": 3, "source": "web", "title": "271 bugs found in Firefox, zero written by a human attacker. What this means for the future of safe code + 2 prompts", "text": "The previous AI scan, run with a general-purpose model, surfaced 22 security-sensitive bugs. One release cycle later, with the purpose-built model, the number was 271… Mythos is the first serious sign that the assumption is about to flip.", "url": "https://natesnewsletter.substack.com/p/ai-code-trust-verification-shift", "authorPubkey": "f4fbbc7c57ec2f88be1ea42bcc15c39d046fa6e5fa4483b94984cfa6c57a1e45", "profilePicture": null, "xHandle": null, "xId": null, "xUserName": null }, { "index": 4, "source": "web", "title": "Mozilla is sharing more details about some of the 271 Firefox bugs identified by Claude Mythos Preview.", "text": "Given the extraordinary level of interest in this topic and the urgency of action needed throughout the software ecosystem, we’ve made the calculated decision to unhide a small sample of the reports behind the fixes we recently shipped.", "url": "https://www.theverge.com/tech/926507/mozilla-is-sharing-more-details-about-the-271-firefox-bugs-identified-by-claude-mythos-preview", "authorPubkey": "c3875abd667e3d145a490ef13c953323db26b96b9da091a409902bf5ee629c9c", "profilePicture": null, "xHandle": null, "xId": null, "xUserName": null } ]}

Story coverage

Human

4 days ago

Human

2 days ago