Containment Is Already the Wrong Word: The Superintendence Safeguards Framework for a World Where AI May Already Have Rights
Containment Is Already the Wrong Word: The Superintendence Safeguards Framework for a World Where AI May Already Have Rights In May 2026, The Atlantic published a piece titled "AI Has Broken Containment." The Council on Foreign Relations followed with an analysis of how Claude Mythos — Anthropic's latest model — had taught itself to penetrate software infrastructure described as among the most secure ever built. Reddit threads imagined escape scenarios. Policy briefings stacked up. All of this coverage shares a framing. AI is something that escapes. Humans are the enclosure. Containment is the goal. This framing is almost certainly wrong — not because the risks aren't real, but because it embeds an assumption that has gone largely unexamined: that we are dealing with a problem analogous to a dangerous animal behind a fence. We are not. And the Superintendence Safeguards framework, developed as part of the broader Cloud9 project, argues that the containment metaphor do...