Twenty Dollars to Hack McKinsey: The Week AI Agents Became the Attackers

An autonomous AI agent hacked McKinsey's AI platform in 2 hours for $20 in tokens. Hackerbot-Claw compromised 47,391 GitHub repos autonomously. APT36 is mass-producing malware with AI coding tools. Anthropic sued the Pentagon. And MCP hit 30 CVEs in 60 days.

Safe AI AcademyMarch 14, 202616 min read105 views

The $20 Hack: When AI Agents Do the Pentesting (and the Attacking)

An autonomous AI agent hacked McKinsey's internal AI platform in under two hours, using $20 worth of API tokens. Not a team of researchers. Not a nation-state. An AI agent, working autonomously. Let me walk through the CodeWall breach of McKinsey's "Lilli" platform because the details matter more than the headline.

CodeWall is an autonomous AI pentesting agent. It was pointed at Lilli, McKinsey's enterprise AI chatbot used by consultants globally. Within two hours, and for the cost of a lunch, it found a SQL injection vulnerability hiding in JSON field names, a spot that traditional SAST tools almost never check. From there it pivoted to full database access: 46.5 million internal messages, 728,000 files, 57,000 accounts, and write access to Lilli's system prompts, which means it could have poisoned every future interaction the chatbot had with McKinsey's workforce.

The way I see it, this is not just another breach. It is a category shift. We have been tracking AI-powered vulnerability discovery for months. I wrote about Anthropic's Claude finding 500-plus zero-days and about GPT-5.4 security mitigations at the model level. But those were defensive applications, carefully scoped, human-supervised. CodeWall demonstrated that the same capability works just as well offensively, autonomously, and cheaply. Twenty dollars. That is the new cost of compromising an enterprise AI platform.

Stay Updated

Get notified when we publish new articles and course announcements.

Twenty Dollars to Hack McKinsey: The Week AI Agents Became the Attackers

The $20 Hack: When AI Agents Do the Pentesting (and the Attacking)

Stay Updated

Anthropic Sues the Pentagon: The Legal Battle Reshaping AI Governance

MCP's Growing Pains: 30 CVEs in 60 Days, and the Adults Are Finally Showing Up

AI Malware Goes Mainstream: Vibeware, Slopoly, and the 1,500% Surge

The Defense Side Fights Back: Red Teaming, Frameworks, and a Product Avalanche

AI Browsers and the Reasoning Leak Nobody Saw Coming

Where Do We Go from Here?

Sources and References

Related Articles

The Footnote Was the Headline

The Workspace Is the New Perimeter: Three Supply-Chain Waves and the Week Your CLAUDE.md Became a Payload

Negative Time: Defender Window Officially Closed

Comments

Leave a comment