Securing Software for the AI Era | Anthropic

Identifying Vulnerabilities and Exploits with Claude Mythos Preview

In recent weeks, the Claude Mythos Preview has proven invaluable in uncovering a multitude of previously unknown zero-day vulnerabilities—software flaws that had not been recognized by their developers. These findings cover critical issues across major operating systems, web browsers, and various other essential software applications.

An in-depth post on our Frontier Red Team blog shares technical insights on select vulnerabilities that have been addressed, including methods that Mythos Preview employed to exploit them. Remarkably, it was able to autonomously identify nearly all of these vulnerabilities and develop exploits without any human intervention. Here are three notable examples:

Mythos Preview uncovered a 27-year-old vulnerability in OpenBSD, an operating system known for its security features, primarily used for running firewalls and critical infrastructure. This flaw enabled an attacker to remotely crash any computer running the OS merely by connecting to it.
A 16-year-old vulnerability in FFmpeg was discovered, specifically within a line of code that automated testing tools had encountered five million times without detection. FFmpeg is widely utilized in software for video encoding and decoding.
The model also independently identified and linked several vulnerabilities in the Linux kernel—the foundational software for the majority of global servers—facilitating an attacker’s escalation from standard user access to full machine control.

These vulnerabilities have been reported to their respective software maintainers, and all have been patched. Today, we are also providing a cryptographic hash of details for various other vulnerabilities (detailed on the Red Team blog), with the intention to disclose full specifics once fixes are implemented.

Evaluation benchmarks, such as CyberGym, underscore the significant advancements of Mythos Preview in comparison to our next best model, Claude Opus 4.6:

Cybersecurity Vulnerability Reproduction

In addition to our own findings, many partners have recently employed Claude Mythos Preview, generating a wealth of insights:

“AI capabilities have crossed a threshold that fundamentally changes the urgency required to protect critical infrastructure from cyber threats, and there is no going back. Our foundational work with these models has shown we can identify and fix security vulnerabilities across hardware and software at a pace and scale previously impossible. That is a profound shift, and a clear signal that the old ways of hardening systems are no longer sufficient.
Providers of technology must aggressively adopt new approaches now, and customers need to be ready to deploy. That is why Cisco joined Project Glasswing—this work is too important and too urgent to do alone.”

“At AWS, we build defenses before threats emerge, from our custom silicon up through the technology stack. Security isn’t a phase for us; it’s continuous and embedded in everything we do. Our teams analyze over 400 trillion network flows every day for threats, and AI is central to our ability to defend at scale.
We’ve been testing Claude Mythos Preview in our own security operations, applying it to critical codebases, where it’s already helping us strengthen our code. We’re bringing deep security expertise to our partnership with Anthropic and are helping to harden Claude Mythos Preview so even more organizations can advance their most ambitious work with security that sets the standard.”

“As we enter a phase where cybersecurity is no longer bound by purely human capacity, the opportunity to use AI responsibly to improve security and reduce risk at scale is unprecedented. Joining Project Glasswing, with access to Claude Mythos Preview, allows us to identify and mitigate risk early and augment our security and development solutions so we can better protect customers and Microsoft.
When tested against CTI-REALM, our open-source security benchmark, Claude Mythos Preview showed substantial improvements compared to previous models. We look forward to partnering with Anthropic and the broader industry to improve security outcomes for all.”

Igor Tsyganskiy

EVP of Cybersecurity and Microsoft Research, Microsoft

Read announcement

“The window between a vulnerability being discovered and being exploited by an adversary has collapsed—what once took months now happens in minutes with AI.
Claude Mythos Preview demonstrates what is now possible for defenders at scale, and adversaries will inevitably look to exploit the same capabilities. That is not a reason to slow down; it’s a reason to move together, faster. If you want to deploy AI, you need security. That is why CrowdStrike is part of this effort from day one.”

“In the past, security expertise has been a luxury reserved for organizations with large security teams. Open-source maintainers—whose software underpins much of the world’s critical infrastructure—have historically been left to figure out security on their own.
By giving the maintainers of these critical open-source codebases access to a new generation of AI models that can proactively identify and fix vulnerabilities at scale, Project Glasswing offers a credible path to changing that equation. This is how AI-augmented security can become a trusted sidekick for every maintainer, not just those who can afford expensive security teams.”

“Promoting the cybersecurity and resiliency of the financial system is central to JPMorganChase’s mission, and we believe the industry is strongest when leading institutions work together on shared challenges. Project Glasswing provides a unique, early-stage opportunity to evaluate next-generation AI tools for defensive cybersecurity across critical infrastructure both on our own terms and alongside respected technology leaders.
We will take a rigorous, independent approach to determining how to proceed and where we can help. Anthropic’s initiative reflects the kind of forward-looking, collaborative approach that this moment demands.”

Pat Opet

Chief Information Security Officer, JPMorganChase

“Google is pleased to see this cross-industry cybersecurity initiative coming together and to make Mythos Preview available to participants via Vertex AI. It’s always been critical that the industry work together on emerging security issues, whether it’s post-quantum cryptography, responsible zero-day disclosure, secure open-source software, or defense against AI-based attacks.
We have long believed that AI poses new challenges and opens new opportunities in cyber defense, which is why we’ve built AI-powered tools—such as Big Sleep and CodeMender—to find and fix critical software flaws. We will continue investing in our leading cybersecurity platform and a culture focused on protecting users, customers, the ecosystem, and national security.”

“Over the past few weeks, we’ve had access to the Claude Mythos Preview model, using it to identify complex vulnerabilities that prior-generation models missed entirely. This is not only a game changer for finding previously hidden vulnerabilities, but it also signals a dangerous shift where attackers can soon find even more zero-day vulnerabilities and develop exploits faster than ever before.
It’s clear that these models need to be in the hands of open-source owners and defenders everywhere to find and fix these vulnerabilities before attackers get access. Perhaps even more important: everyone needs to prepare for AI-assisted attackers. There will be more attacks, faster attacks, and more sophisticated attacks. Now is the time to modernize cybersecurity stacks everywhere. We commend Anthropic for partnering with the industry to ensure these powerful capabilities prioritize defense first.”

The robust cyber capabilities of Claude Mythos Preview stem from its exceptional coding and reasoning abilities. As illustrated in the evaluation results below, the model excels in various software coding tasks, achieving the highest scores among all developed models.

Mythos Preview without tools

Mythos Preview with tools

For further insights about the model’s capabilities, safety properties, and attributes, please refer to the Claude Mythos Preview system card.

While we currently have no plans to make Claude Mythos Preview broadly available, our ultimate objective is to facilitate secure deployment of Mythos-class models at scale—both for cybersecurity purposes and the diverse benefits these advanced models can offer. Achieving this requires ongoing development of cybersecurity safeguards to identify and mitigate the model’s most hazardous outputs. We anticipate launching new safeguards soon alongside the upcoming Claude Opus model, enabling us to enhance and refine them with a model that poses less risk than Mythos Preview³.

Plans for Project Glasswing

Today marks the inception of a long-term initiative. Its success hinges on widespread collaboration across the technology sector and beyond.

Project Glasswing partners will gain access to Claude Mythos Preview to discover and address vulnerabilities or weaknesses within their foundational systems—representing a significant portion of the global cyberattack surface. We expect this endeavor to focus on tasks such as local vulnerability detection, black box testing of binaries, endpoint security, and system penetration testing.

Anthropic’s commitment of $100 million in model usage credits to Project Glasswing and additional collaborators will cover substantial usage during this research preview. Subsequently, Claude Mythos Preview will be accessible to participants at a rate of $25/$125 per million input/output tokens (available via the Claude API, Amazon Bedrock, Google Cloud’s Vertex AI, and Microsoft Foundry).

Alongside our commitment to model usage credits, we have contributed $2.5 million to Alpha-Omega and OpenSSF through the Linux Foundation, and $1.5 million to the Apache Software Foundation to support open-source software maintainers in navigating this evolving landscape (maintainers interested in access can apply through the Claude for Open Source program).

We envision this effort expanding in scope and sustaining for many months, sharing valuable insights to help other organizations adapt their security practices. Partners will exchange information and best practices collaboratively; within 90 days, Anthropic will provide a public report detailing what we’ve learned, including vulnerabilities addressed and improvements achieved that can be disclosed. We will also work alongside leading security organizations to produce a set of practical recommendations for evolving security practices in the AI era. This may encompass:

Vulnerability disclosure processes;
Software update processes;
Open-source and supply-chain security;
Software development lifecycle and secure-by-design practices;
Standards for regulated industries;
Triage scaling and automation;
Patching automation.

Anthropic has engaged in ongoing discussions with U.S. government officials regarding Claude Mythos Preview and its cyber capabilities—both offensive and defensive. As previously mentioned, securing critical infrastructure is a paramount national security priority for democratic nations; the advent of these cyber capabilities underscores the necessity for the U.S. and its allies to maintain leadership in AI technology. Governments play a crucial role in sustaining that advantage and in evaluating and mitigating the national security risks associated with AI models. We are prepared to collaborate with local, state, and federal representatives to assist in these initiatives.

Our hope is that Project Glasswing can catalyze a larger industry and public sector effort, with all stakeholders working to tackle significant questions regarding the implications of powerful AI models on security matters. We invite other members of the AI community to join us in establishing industry standards. In the medium term, the creation of an independent, third-party organization—uniting private and public-sector entities—could serve as an ideal foundation for continued large-scale cybersecurity projects.

Identifying Vulnerabilities and Exploits with Claude Mythos Preview

Plans for Project Glasswing

Leave a Reply 取消回复

You May Also Like

I’ve Got a Hunch

Using GPT-5.6: A Guide from Ben’s Bites

Grok and Cursor Collaboration