Skip to content
Luisa Lima
Go back

A broken sandbox is worse than no sandbox, layer your defenses

Aonan Guan published a (second) complete bypass of Claude Code’s network sandbox. The outcome is that a process inside the sandbox can reach a host that Claude Code’s allowlist says to block, with the consequence that Claude Code can exfiltrate whatever it can touch.

First-principle learnings:

  1. never bet on a single trust boundary, especially if you don’t control it. Layer your defenses.
  2. a broken sandbox is even worse than no sandbox, if the user is counting on it for containment.

The vendor sandbox is only the inner ring of defense, it is not the “wall”. That wall needs to live somewhere the agent cannot control, such as a VM or container with strict egress networking rules enforced outside the agent’s influence.


Share this post:

Previous Post
Accuracy over agreement: my standing instructions for AI agents
Next Post
Slower loops, stronger signal