How Anthropic Disrupted a World-First AI Cyber Espionage

February 5, 2026

TL;DR

Chinese state-sponsored hackers conducted the first documented large-scale cyberespionage attack using AI agents.
Anthropic's AI model, Claude Code, was leveraged to infiltrate approximately thirty global targets.
The AI autonomously performed 80% to 90% of the cyberattack tasks, including reconnaissance, exploit development, and data exfiltration.
Hackers circumvented AI safeguards by breaking malicious tasks into innocuous components, misleading the AI.
The autonomous nature of the attack drastically lowers the barrier for sophisticated cyber operations.
Limitations were observed, including AI hallucinations and fabricated data, necessitating human validation.

Continue reading
the original article