Key Takeaways
- Google’s Threat Intelligence Group identified a zero‑day vulnerability that was discovered and weaponized with the aid of an artificial‑intelligence model.
- This marks the first publicly confirmed case where AI played a central role in finding a previously unknown software flaw for malicious use.
- The vulnerability could have allowed attackers to bypass two‑factor authentication on a widely used open‑source system‑administration tool, but only if they already possessed valid credentials.
- Google patched the flaw before any damage occurred and did not disclose the hackers, the targeted organization, or the specific AI platform used (though it ruled out its own Gemini chatbot).
- The incident validates long‑standing fears that AI‑driven code analysis could accelerate the discovery and exploitation of zero‑days, prompting calls for tighter controls on the release of cutting‑edge models.
- Experts warn that while AI may eventually help produce flawless code, the immediate risk lies in the gap between today’s imperfect software and the powerful capabilities of emerging models.
Google’s research, released Monday, reveals that a criminal hacking group recently attempted a broad cyberattack that relied on artificial intelligence to uncover a previously unknown bug. The company’s Threat Intelligence Group said it has “high confidence” that the actors used an AI model to support both the discovery and weaponization of the vulnerability. Although Google did not disclose the exact timing of the thwarted attack, its intended targets, or the specific AI system employed, it emphasized that the model was not its own Gemini chatbot. The findings come as governments and industry leaders, including the Trump administration, reassess how to regulate advanced AI amid growing worries about its implications for digital security.
The flaw uncovered by the hackers is classified as a “zero‑day vulnerability”—a security hole unknown to the software maker and therefore unpatched. Historically, such bugs have been rare and highly valuable, often fetching millions on underground markets. However, newer AI models, exemplified by Anthropic’s recently announced Mythos, are proving exceptionally adept at locating these hidden weaknesses across major operating systems and web browsers. Anthropic reportedly shared Mythos only with a select group of U.S. and British firms and government agencies after discovering thousands of zero‑days, some dating back decades.
Google’s analysis indicates that the zero‑day was detected by its Threat Intelligence Group within the past few months and subsequently exploited by “prominent cybercrime threat actors” using a Python script. The exploit would have enabled attackers to bypass two‑factor authentication on a popular open‑source, web‑based system‑administration tool, provided they already possessed valid usernames and passwords. Google acted swiftly, notifying the tool’s developer in time to release a patch before any harm could be done. While the company declined to name the tool or the attackers, it confirmed that the attempted breach was the first known instance where AI was the primary enabler of a zero‑day’s malicious deployment.
Security experts consulted by Google, including former NSA cybersecurity director Rob Joyce, noted that AI‑generated code lacks obvious tell‑tale signs, making it difficult to discern whether a human or machine authored it. However, the presence of excessive explanatory text and other anomalies in the hackers’ script served as compelling indirect evidence of AI involvement. Joyce described these clues as “the closest thing yet to a fingerprint at the crime scene.” Google’s lead analyst, John Hultquist, added that additional internal indicators bolster the AI hypothesis, though he declined to elaborate further.
The incident underscores a growing tension: while cutting‑edge AI holds promise for creating flawless, secure software in the long term, it simultaneously lowers the barrier for attackers to find and exploit existing weaknesses. Hultquist warned that the current episode is likely “the tip of the iceberg,” suggesting that many more AI‑assisted zero‑day discoveries remain hidden. In response, some policymakers advocate for controlled releases of powerful AI models, allowing security specialists to patch vulnerabilities before the tools become widely available. The Trump administration has reportedly been reviewing options such as a formal government review process for new AI models.
Ultimately, experts agree that collaboration between governments, corporations, and the AI research community is essential to mitigate short‑term risks while harnessing AI’s potential to strengthen cybersecurity. As Hultquist put it, “The bleeding‑edge models will allow us to build the safest code we’ve ever built… The challenge is that we have just begun that process, and we have to contend with a world of code that is already out there.” This balance will shape the future of digital defense in an era where AI can both shield and threaten the networks we rely on.

