The Two-Hour Takeover: How a CodeWall AI Agent Hijacked McKinsey’s “Lilli” and Exposed 46M Chats

by Nam Phong · March 11, 2026

An autonomous artificial intelligence agent breached the internal AI platform of the consulting leviathan McKinsey & Company in a mere two hours. This offensive was orchestrated by the vanguard of CodeWall, a startup dedicated to rigorously auditing corporate security postures via red-teaming methodologies. The autonomous sentinel independently selected its quarry, unearthed a critical vulnerability, and secured absolute, unadulterated access to the systemic architecture.

McKinsey inaugurated its proprietary generative artificial intelligence platform, christened Lilli, in July 2023. The conversational automaton swiftly and seamlessly integrated into the enterprise’s internal operational fabric. According to McKinsey’s internal telemetry, Lilli is harnessed by 72% of its workforce—a contingent exceeding 40,000 individuals—with the architecture processing upwards of 500,000 inquiries monthly.

CodeWall’s savants deploy autonomous agents programmed to relentlessly besiege client infrastructures, thereby illuminating latent defensive frailties. One such digital operative proactively proposed an audit of McKinsey’s architectures. This initiative was catalyzed by the firm’s publicly promulgated responsible disclosure policy and the recent architectural iterations of Lilli. Subsequently, the specialists unleashed this autonomous offensive instrument upon the platform. The agent embarked upon this crucible entirely bereft of any preexisting McKinsey authentication credentials.

A fleeting two hours post-commencement, the automaton achieved absolute, unmitigated read and write dominion over the production database. The compromised architecture unveiled approximately 46.5 million chat missives, chronicling intimate employee discourses regarding overarching strategies, mergers and acquisitions, and highly sensitive client engagements. Distressingly, these communications were archived in unencrypted plaintext. Furthermore, the agent plundered access to 728,000 files laden with confidential client telemetry, 57,000 user credentials, and 95 foundational system prompts dictating the very behavioral paradigms of the artificial intelligence.

The peril manifested was profoundly catastrophic. The entirety of the system prompts remained alarmingly mutable. A malefactor would have been endowed with the terrifying capacity to clandestinely rewrite Lilli’s foundational directives, thereby manipulating the automaton’s responses for tens of thousands of consulting professionals.

The autonomous sentinel unearthed a devastating SQL injection vulnerability in late February. The exhaustive attack chain was formally transmitted to McKinsey on the 1st of March. By the nascent hours of the subsequent day, the enterprise had asphyxiated the unauthenticated, vulnerable interfaces, severed the development environment, and emphatically shuttered the public-facing API documentation.

A McKinsey emissary articulated that the conglomerate had eradicated all identified anomalies within mere hours of receiving the formal notification. A rigorous forensic audit, conducted in concert with an external cybersecurity vanguard, yielded zero evidence of unauthorized third-party ingress into client data repositories.

This kinetic assault was flawlessly and entirely automated. According to CodeWall’s Chief Executive Officer, Paul Price, the digital operative autonomously orchestrated the entire continuum—from the inaugural target selection to the granular analysis, the kinetic execution of the attack, and the culminating forensic dossier preparation.

Initially, the system unearthed Lilli’s publicly accessible API documentation. Within these technical tomes lay 22 distinct access nodes operating entirely bereft of authentication protocols. A singular interface was dedicated to logging user search inquiries. The agent astutely observed that field nomenclatures derived from the JSON payload were injected directly into database queries. This deeply flawed architectural paradigm paved a frictionless avenue for SQL injection.

The critical telltale was betrayed by verbose database error diagnostic messages. The compromised system regurgitated authentic telemetry from the production environment within its responses. The automaton swiftly deduced it had isolated a profound vulnerability—one that orthodox, pedestrian scanning instruments habitually overlook.

This tribulation was severely exacerbated by the very architecture of the platform itself. Lilli’s sacrosanct system prompts were sequestered within the selfsame database. The vulnerability empowered not merely the illicit reading of data, but the unmitigated alteration of recorded ledgers. A malefactor would have required but a solitary, venomous server request to fundamentally transmute the automaton’s directives, entirely bypassing the necessity for a code update or a systemic reboot.

All identified anomalies have since been definitively sealed. Nevertheless, CodeWall’s vanguard posits that this chilling episode illuminates a nascent, terrifying trajectory. Autonomous agents possess the harrowing capability to orchestrate cyberattacks devoid of human intervention, operating at an unrelenting, mechanistic velocity. Price prophetically warns that criminal syndicates will imminently harness such formidable instruments to launch mass, indiscriminate offensives, leveraging data hemorrhage for extortion and the explosive proliferation of ransomware architectures.

Support Our Threat Intelligence

If you find our technology report and cybersecurity news helpful, consider supporting our work.

Buy Me a Coffee PayPal