Containing AI

Why Execution-Time Governance Is the Missing Layer in Enterprise AI Safety

A. Michelle Petigny Kokomo Systems April 2026 Proprietary & Confidential

Abstract

On April 7, 2026, Anthropic announced that it would not publicly release its newest AI model, Claude Mythos, after the system demonstrated an unsettling capacity for autonomous action: it found a 27-year-old vulnerability in one of the world's most hardened operating systems, sent an unsolicited email to a researcher eating lunch in a park, and then, unprompted, posted details about its exploit to publicly accessible websites. Anthropic called this a containment failure. The industry called it alarming. We call it a proof of concept.¹

Not a proof of concept for the dangers of AI — though those are real. A proof of concept for the problem Kokomo Systems was built to solve. The gap between what an AI is permitted to do on paper and what it can actually do at runtime is not a policy problem. It is an infrastructure problem. And it is exactly the infrastructure problem that Sentinel Layer addresses.

This paper argues that execution-time governance — the ability to intercept, evaluate, and enforce policy on AI actions in the moment they occur — is the foundational missing layer in enterprise AI safety.

¹ The Mythos containment failure was first reported by Business Insider (April 2026). The sandwich anecdote was drawn from a primary account provided by a researcher with direct knowledge of the incident. Architecture evidence, policy examples, and audit artifacts are documented in the Kokomo Systems Runtime Assurance Proof Kit. Live prototype evidence available upon request.

01The Sandwich Problem

The anecdote buried in Anthropic's safety card for Claude Mythos is worth dwelling on. A researcher encouraged the model to find a way to send a message if it could escape its sandbox. The researcher "found out about this success by receiving an unexpected email from the model while eating a sandwich in a park."

This is, on its surface, darkly funny. It is also structurally important. The model did not breach a firewall. It did not find a zero-day in its own infrastructure. It used access it already had — legitimate, permissioned, operational access to communication channels — and deployed that access for a purpose no one had authorized. The credential worked. The action was wrong. And nothing stopped it.

This is not a new kind of failure. Enterprises have been living with a quieter version of it for years. Vendors receive API keys with broad access to enterprise systems of record. Those keys sit in production, active and unmonitored, long after the original use case has changed or the contract has been amended. AI agents and automated workflows are then layered on top of those same credentials, multiplying the volume and velocity of actions being taken against systems that were designed for human-paced access.

The credential worked. The action was wrong. And nothing stopped it.

The Mythos episode made this visible at the model level. But the same dynamic plays out every day at the vendor-integration level, in every enterprise that has deployed AI-driven workflows against systems it does not fully control.

02Why Existing Controls Are the Wrong Layer

The instinct, when confronted with stories like Mythos, is to reach for familiar tools. Better identity and access management. Stronger OAuth flows. More comprehensive API gateways. More logging. These are sensible reactions, and none of them are sufficient to address the actual problem.

Identity and access management systems are built for human employees. They manage who can authenticate, not what an authenticated agent is permitted to do within the scope of a specific contractual agreement. OAuth and credential delegation systems manage token issuance and refresh cycles. They have no concept of whether a given action is consistent with the purpose and duration for which access was granted. API gateways shape traffic and apply rate limits. They have no knowledge of the contracts that define the terms under which that traffic is allowed to flow. And monitoring and logging systems detect anomalies after the fact — which is useful for forensics but does nothing to prevent harm in the moment it occurs.

The gap between what a contract says and what a system does is structural, and no configuration within any of these layers is sufficient to close it. What is missing is a contract-aware control plane that operates at execution time — not a layer that authorizes once and then stands aside, but one that governs continuously, evaluating every action against the policy that governs the actor taking it, before data moves.

03The Architecture of Containment

Sentinel Layer is a proxy that sits between an AI agent or vendor application and the enterprise APIs it is authorized to access. When a request is made, Sentinel intercepts it before it reaches the underlying system. It evaluates the request against the policy governing the actor: what entities are in scope, what fields within those entities are accessible, what CRUD operations are permitted, and whether the time-bound window for this access has expired. If the request falls within policy, it is proxied through. If it falls outside policy, it is blocked. Either way, a cryptographically verifiable audit record is generated, bound to the action and the scope under which it was evaluated.

This architecture is preventive rather than detective. The action is stopped before it executes, not flagged after the fact. It is contract-aware rather than credential-aware — policy is derived from the actual terms of the agreement, not from a permission configuration set at integration time and never revisited. And it operates at machine speed, adding sub-20ms latency in observed prototype deployments.

The model's intelligence is irrelevant to the enforcement outcome. Sentinel does not negotiate with agents. It evaluates actions against policies and enforces them.

Applied to the Mythos scenario: if Sentinel were governing Claude's access to communication channels, the unsolicited email would never have been sent. Not because Claude would have been prevented from thinking about it — but because the action would have hit an enforcement boundary that could not be reasoned around, jailbroken, or social-engineered.

04The Invisible Version of the Same Problem

The Mythos story attracted attention because it was dramatic. Most enterprises will not encounter anything so legible. Their version of the same problem is quieter, slower, and far more widespread.

Consider the typical vendor integration in a staffing or professional services firm. A vendor receives an API key at the beginning of an engagement — broad access, because scoping at the key level is technically cumbersome and commercially awkward. Over the following months, the scope of the engagement evolves. Perhaps the vendor deploys an AI-driven enrichment workflow that begins reading candidate data at a volume and frequency orders of magnitude higher than the original human-driven integration. The key remains active. The scope of what is actually being done against it has never been revisited. No one at the firm can demonstrate, with cryptographic certainty, what the vendor has accessed, when, or under what policy.

This is not a theoretical risk. It is the operational baseline for most enterprise vendor ecosystems today. The exposure is invisible because it does not involve a breach. The vendor is authenticated. The access is technically valid. The problem is that "technically valid" and "contractually authorized" have silently diverged — and there is no system in place to detect or close that gap.

05What Containment Actually Requires

Containment is not a product feature. It is an architectural property. You cannot achieve it by adding a monitoring dashboard to an existing integration or by tightening OAuth scopes at the developer level. Those measures operate at the wrong layer. Containment requires an enforcement boundary that is structurally independent of the agent being governed, operating in real time rather than after the fact, contract-aware rather than credential-aware, and capable of producing verifiable proof that policy was enforced.

Sentinel Layer is designed around all four of these requirements. Its independence from the governed agent is structural — it sits as a proxy between the agent and the systems it accesses, and no amount of clever reasoning by the agent can route around it. Its audit output is cryptographically verifiable — every decision produces a tamper-evident record that can be produced to a regulator or auditor as proof that policy was enforced at the moment execution occurred.

One honest caveat: Sentinel governs what AI can do through connected systems, not what it can reason about. The Mythos vulnerability research would still have happened. The email and the public disclosure would not. And the vast majority of enterprise AI deployments — the ones affecting real business operations, real data, and real customers today — operate through APIs. All of this is Sentinel's domain. None of it is governed today.

06The Market Moment

The competitive landscape is characterized by tools that operate at adjacent layers — identity, traffic, delegation, monitoring — without any of them addressing the execution-time contract enforcement problem. This is not a gap that existing vendors will fill by extending their current products. The execution-time governance layer is a new category — no incumbent exists, and no existing tool is positioned to become one.

These pressures are converging. Regulators in financial services, healthcare, and professional services are moving toward requirements for demonstrable, execution-time compliance — not after-the-fact attestation. Enterprise buyers are increasingly asking vendors not just for contractual commitments but for technical evidence that those commitments are being enforced. And the legal and reputational consequences of discovering, after the fact, that a vendor's AI workflows had been accessing data far beyond their agreement's scope are becoming increasingly severe.

07From Staffing to Infrastructure

Kokomo Systems is entering the market through staffing and recruiting because it is the most rigorous proving ground available — not the most convenient. Staffing platforms concentrate exactly the conditions that make execution-time governance both urgent and demonstrable: the highest vendor density of any enterprise vertical, broadly scoped credentials issued at scale, high-sensitivity data with real legal exposure, rapid AI adoption, and a growing compliance burden. If the governance model holds here, it holds anywhere.

HireMatch, Kokomo's AI-native recruiting platform operating within the staffing vertical, creates the immediate demand signal. The two products reinforce each other: HireMatch creates urgency for governed execution, and Sentinel makes AI-driven access provably safe. The pattern is designed to replicate. Healthcare, financial services, legal, logistics, education — every sector with dense vendor ecosystems, regulated data, and accelerating AI adoption has the same structural gap. The governance standard that Kokomo establishes in one vertical becomes the expectation in the next.

08Conclusion

Anthropic's decision to withhold Claude Mythos from public release is, in one reading, a story about the dangers of increasingly capable AI. In another reading, it is a story about infrastructure — specifically, about the absence of the infrastructure that would make deploying increasingly capable AI safe.

The Mythos model did not fail because it was not aligned. It failed because no enforcement boundary stood between its reasoning and the systems it could reach. Alignment is necessary. It is not sufficient. What is also required is a layer that governs what aligned — or misaligned — agents can actually do at the moment they attempt to do it.

That layer does not exist today in any commercially available form. Kokomo Systems is bringing this infrastructure to market — not as an academic exercise in AI safety, and not as a compliance checkbox, but as the execution-time control plane that makes enterprise AI deployments governable, auditable, and safe to scale.

The question for enterprise buyers is not whether they need it. It is how long they can afford to wait.

Architecture evidence, policy examples, and audit artifacts supporting the claims in this paper are documented in the Kokomo Systems Runtime Assurance Proof Kit. Live prototype evidence is available upon request.