The agent control plane is the product
A production AI agent is only as good as the control plane around it: scope, identity, tool boundaries, evals, observability, approvals, and rollback. Better prompts do not replace that operating model.
Topic archive
16 essays tagged Agentsecurity. Practical notes on what happens after the demo: prompts, tools, review packets, evals, rollback, and production ownership.
A production AI agent is only as good as the control plane around it: scope, identity, tool boundaries, evals, observability, approvals, and rollback. Better prompts do not replace that operating model.
When an agent runs under your personal login, the audit trail cannot tell your actions from the agent’s. Give it a scoped, named identity so you can watch it, revoke it, and explain it.
The agent’s diff is the most expensive way to learn what it was planning to do. Run it once in shadow mode, where it produces a complete plan but executes nothing, and review the plan before you let it touch anything.
A Claude Code run that widens scope after a one-time approval is working under an authorization you never gave. A new permission should mean a stop, a request, and a logged human approval, not a footnote in the summary.
MCP servers are not harmless connectors once agents use them to reach tickets, data, APIs, deployment tools, or RAG systems. Treat them as part of the security boundary.
A production AI agent rollout fails when engineering proves the patch and security has to reconstruct the authority later. Run the delivery loop and the control loop together.
Claude Code teams need delivery evidence. Enterprise AI agent teams need authority evidence. One small control record can connect both without turning every agent run into a governance ceremony.
Claude Code and enterprise AI agents need more than permissions. Teams need explicit stop rules that tell the agent when to pause, collect evidence, and hand control back to a human.
AI agent platforms do not decide your risk appetite. Before teams wire Claude Code, MCP, RAG, workflow tools, and release automation into production work, they need a clear delegation policy.
Agent generated pull requests need more than a clean diff. Teams need a control record that captures scope, tools, tests, review evidence, rollback, and owner approval.