Thomas Talks AI
I write about production AI engineering: secure AI agents, Claude Code, LLM observability, evals, governance, financial-services controls, and the engineering work that starts after the demo works.
If you are trying to move AI from a clever local experiment into a team workflow, this site is about the part that decides whether it survives: permissions, review, observability, rollback, cost, and production discipline.
Books
If you want the longer version of the ideas here, these are the two current field guides. The blog starts below.


Start here
- From AI POC to production: the operating model that makes AI survive beyond the demo.
- LLM observability: why production AI needs replayable evidence, not only dashboards.
- AI agents in financial services: controls before autonomy.
- Agentic coding in production: the operating model around AI coding agents.
- AI coding agents: Claude Code, software agents, permissions, evals, and review loops.
- Claude Code production checklist: a practical checklist before agents touch serious repos.
- Enterprise AI Agents in Production: the two-book LeanPub bundle for building and securing production AI agents.
- Securing Enterprise AI Agents: the enterprise security guide for bounded autonomy, AgentSecOps, MCP security, RAG governance, and regulatory readiness.
- Claude Code book: the full field guide for production agents.
What I am trying to answer
Claude Code and other AI coding agents are already useful. The harder question is what happens when they meet real repositories, review habits, permissions, tests, and production risk.
The strongest posts so far:
- Claude Code is not the product. The production loop is.
- Claude Code agents need a flight recorder
- Claude Code permissions: the production mistake that bites later
- Claude Code evals should start with bad runs
