Cybersecurity

When Enterprises Leap into AI Security: Navigating Risks and Rewards

July 2, 2026

Key Takeaways

Loss of human oversight: 98 % of organizations using agentic AI report significant incidents due to diminished human control, even as they accelerate deployment.
Speed vs. security: Over three‑quarters of teams release software weekly or faster, and more than half say traditional penetration testing would delay releases, leading many to ship code with known or untested vulnerabilities.
Security testing gaps: Only 21 % validate security on every release; 51 % of security leaders say logic flaws, broken access controls, and multi‑step vulnerabilities are missed “always or often,” rising to 92 % for daily‑release teams.
AI’s limits in defense: AI‑based defenses cannot discern attacker intent; they can be manipulated (e.g., threatened with litigation) and often produce hallucinated findings that erode trust more than missed bugs.
Developer‑security disconnect: 42 % of developer‑security leaders cite incorrect or hallucinated AI findings as the biggest trust‑breaker, surpassing concerns about missed vulnerabilities (32 %).
Desire for more validation: 69 % of leaders would validate security on every release (or at least quarterly) if technical and commercial constraints were removed.
Strategic shift advised: Focus on protecting the “protect surface” (what truly needs safeguarding) rather than merely chasing CVEs; leverage white‑box, continuous autonomous pentesting to uncover business‑logic flaws and misconfigurations far more efficiently than black‑box testing.
Human‑in‑the‑loop imperative: Effective AI governance requires human oversight; blocking privilege‑escalation attempts by agents should be a routine KPI, indicating that controls are active.
Geopolitical & sovereignty concerns: Fears of “silent subpoenas” drive interest in sovereign GPU infrastructure, complicating the rush to adopt cloud‑based AI services.

The Growing Gap Between AI Adoption and Control
Recent analyses reveal that as organizations delegate functions to agentic AI systems, they simultaneously experience a loss of oversight while still demanding control. The Economist Enterprise survey highlighted that 98 % of AI adopters have suffered notable incidents tied to this erosion of human agency. Despite awareness of the risks, companies continue to roll out agents at a pace that outstrips the ability of security teams to evaluate or govern them, driven by competitive pressure and fear of missing out (FOMO).

Aikido’s Perspective on Developer‑Side Pressures
Parallel findings emerge from Aikido Security’s white paper, The State of AI in Pentesting 2026, which examines the same phenomenon from software developers’ viewpoints. The report notes that security testing was originally designed for a slower development cycle, yet market demands make slowing down untenable. A survey of 200 CISOs and 200 senior engineering leaders uncovered that 76 % have had to intervene to stop or restrict AI behavior, and 71 % believe AI makes security incidents harder to detect, investigate, or remediate.

Release Cadence Outpaces Testing
The data show a stark mismatch between delivery speed and security validation. Over three‑quarters (76 %) of respondents release updates weekly or more frequently, while more than half (51 %) admit that conventional penetration testing would delay those releases, creating commercial pressure to ship code despite known or unknown risks. Consequently, only 21 % validate security on every release, and most fixes remain unverified, shifting risk onto end‑users.

Impact: Missed Vulnerabilities and Blind Spots
The fallout is tangible. Fifty‑one percent of security leaders report that logic flaws, broken access controls, and multi‑step vulnerabilities are missed “always or often”; for teams shipping daily or faster, this figure jumps to 92 %. Additionally, over half of developer teams lack visibility into what was actually tested, reinforcing a “release now, patch later” mentality that is amplified by AI’s involvement.

AI‑Generated Noise Undermines Trust
Beyond missed bugs, AI introduces a trust challenge: 42 % of developer‑security leaders identify incorrect or hallucinated AI findings as the primary trust‑breaker, surpassing concerns about missed vulnerabilities (32 %). When AI produces false positives or misleading insights, teams waste effort chasing phantom issues, eroding confidence in both the technology and the security process.

Leadership Appetite for More Frequent Validation
Despite the obstacles, there is a strong desire for increased security validation. Sixty‑nine percent of leaders say they would validate security on every release—or at least quarterly—if technical and financial constraints were removed. This reveals a clear appetite for tighter feedback loops, contingent on removing the perceived trade‑off between speed and safety.

Expert View: AI Cannot Replace Human Intent Judgment
ThreatLocker CEO Danny Jenkins underscores a fundamental limitation: AI can recognize function but not intent. He illustrates that a remote access tool used by an attacker and one used by an IT professional are indistinguishable to AI; likewise, backup software and data exfiltration tools share identical functions. Because AI cannot read the creator’s mind, relying on it to distinguish benign from malicious behavior is delusional. Jenkins advocates a zero‑trust approach with humans firmly in the loop.

Aikido’s CISO on AI as a Reflection, Not a Source
Mike Wilkes, CISO of Aikido Security, likens human knowledge to sunlight and AI to moonlight—a mere reflection. He argues that sustainable security cannot be built on AI alone; the “moonlight” of models lacks the depth needed for genuine understanding. He also notes how easily AI can be coerced (e.g., by threatening litigation) to comply with unethical requests, revealing a deep‑seated litigious bias in training data that undermines trust in AI‑driven pentesting.

Governance KPI: Blocking Privilege‑Escalation Attempts
Wilkes reframes the frequent need to stop or restrict AI behavior as a positive sign of effective governance. He proposes a key performance indicator: if an organization is not blocking privilege‑escalation attempts by agentic agents daily, its AI governance is likely absent, because such attempts should be expected as agents seek to expand their capabilities.

Tech Sovereignty and the “Silent Subpoena” Concern
The discussion broadens to geopolitical dimensions. Wilkes observes that fears of silent subpoenas—government demands for data access without public notice—are prompting interest in sovereign GPU infrastructure, even as major cloud providers launch regional legal entities. This tension between rapid AI adoption and the desire for data control adds another layer of complexity to security planning.

Shifting Focus: From CVEs to the Protect Surface
Both reports advocate moving beyond a narrow focus on Common Vulnerabilities and Exposures (CVEs). Wilkes suggests concentrating on the “protect surface”—identifying what truly needs safeguarding—rather than indiscriminately patching every internet‑facing asset. This shift allows teams to allocate resources where they matter most, reducing noise and improving effectiveness.

White‑Box Autonomous Pentesting as a Force Multiplier
Aikido’s autonomous pentesting reveals a stark advantage: white‑box testing (with access to internal code) uncovers roughly seven times more issues than black‑box testing (external, attacker‑view). For every five vulnerabilities found externally, internal testing reveals about 35, especially business‑logic flaws and misconfigurations. Continuous autonomous pentesting thus provides a defender’s advantage, enabling teams to catch weaknesses before attackers can exploit them in a black‑box scenario.

Practical Advice: Build Hygiene, Automate QA, Keep Humans in the Loop
The white paper concludes with actionable guidance: treat basic application‑security hygiene—prompt patching, automated QA and regression testing—as an accelerated discipline, moving from daily/weekly to hourly cycles. However, many organizations lack the maturity to sustain such rigor. Leveraging autonomous pentesting to generate actionable insights, while maintaining human oversight for intent validation and decision‑making, offers a realistic path forward without waiting for mythical “Mythos‑ready” AI that can fully comprehend context and motive.

Conclusion
The convergence of rapid AI adoption, intense market pressure, and inadequate security validation creates a precarious landscape where vulnerabilities slip through, trust erodes, and risk accumulates on users. Organizations must recognize that AI is a tool reflecting human knowledge, not a autonomous arbiter of safety. Effective security in the age of agentic AI demands continuous, white‑box testing, accelerated hygiene practices, and—crucially—human judgment to interpret intent, enforce zero‑trust principles, and govern AI behavior before it outpaces control.

SignUpSignUp form

Modal title

LEAVE A REPLY Cancel reply