Anthropic’s NSA red-team result helps explain U.S. model access ban

24 June 2026, 15:28·1 min read

Anthropic’s Mythos model reportedly accessed “almost all” classified NSA systems within hours during a controlled security evaluation, according to comments attributed to Gen. Joshua Rudd and relayed by Sen. Mark Warner. The claim spread widely as an alleged NSA hack, but the scenario was an authorized internal red-team test using Mythos alongside defensive tools under specific simulated conditions.

The episode helps explain a June 12 U.S. government directive that barred foreign nationals, including Anthropic’s own non-citizen employees, from accessing Fable 5 and Mythos 5. Anthropic disabled the models globally, saying it could not enforce nationality-based access restrictions in practice. The move was described as the first U.S. export control aimed directly at an AI model rather than the hardware used to run it.

Anthropic says the issue involved a “potential narrow, non-universal jailbreak” that could allow Fable 5 to identify software vulnerabilities, and argues that rival models including OpenAI’s GPT-5.5 show similar behavior. The company is seeking to restore access and develop a risk-management framework with the White House, while continuing work with the NSA through Project Glasswing, where roughly six Anthropic engineers are reportedly embedded inside the agency.

Originally reported by tomshardware.comRead the source →

Related coverage

Security

Anthropic’s NSA red-team result helps explain U.S. model access ban

Five Eyes agencies warn AI could reshape cyber attacks within months

Anthropic feud tests US AI controls

Europe pushes for AI autonomy as US lead widens

EU rejects security-risk label after US order on Anthropic models