Categories AI

Securing Software for the AI Era | Anthropic

Identifying Vulnerabilities and Exploits with Claude Mythos Preview

In recent weeks, the Claude Mythos Preview has proven invaluable in uncovering a multitude of previously unknown zero-day vulnerabilities—software flaws that had not been recognized by their developers. These findings cover critical issues across major operating systems, web browsers, and various other essential software applications.

An in-depth post on our Frontier Red Team blog shares technical insights on select vulnerabilities that have been addressed, including methods that Mythos Preview employed to exploit them. Remarkably, it was able to autonomously identify nearly all of these vulnerabilities and develop exploits without any human intervention. Here are three notable examples:

  • Mythos Preview uncovered a 27-year-old vulnerability in OpenBSD, an operating system known for its security features, primarily used for running firewalls and critical infrastructure. This flaw enabled an attacker to remotely crash any computer running the OS merely by connecting to it.
  • A 16-year-old vulnerability in FFmpeg was discovered, specifically within a line of code that automated testing tools had encountered five million times without detection. FFmpeg is widely utilized in software for video encoding and decoding.
  • The model also independently identified and linked several vulnerabilities in the Linux kernel—the foundational software for the majority of global servers—facilitating an attacker’s escalation from standard user access to full machine control.

These vulnerabilities have been reported to their respective software maintainers, and all have been patched. Today, we are also providing a cryptographic hash of details for various other vulnerabilities (detailed on the Red Team blog), with the intention to disclose full specifics once fixes are implemented.

Evaluation benchmarks, such as CyberGym, underscore the significant advancements of Mythos Preview in comparison to our next best model, Claude Opus 4.6:

In addition to our own findings, many partners have recently employed Claude Mythos Preview, generating a wealth of insights:

The robust cyber capabilities of Claude Mythos Preview stem from its exceptional coding and reasoning abilities. As illustrated in the evaluation results below, the model excels in various software coding tasks, achieving the highest scores among all developed models.

Leave a Reply

您的邮箱地址不会被公开。 必填项已用 * 标注

You May Also Like