A Claude Code flight recorder diagram with task contract, tool calls, evidence, review, and rollback

Claude Code needs a flight recorder

Claude Code can produce a clean patch from a messy run. Production teams need a flight recorder: the task contract, tool calls, permission pressure, tests, assumptions, and rollback notes that explain how the patch was made.

May 23, 2026 · 5 min · 1005 words · Thomas De Vos
Read Claude Code needs a flight recorder
Diagram showing metric-only LLM observability versus a replayable production AI trace

LLM observability is not a dashboard. It is a replayable trail.

A latency chart will not explain why an AI answer was wrong. Production LLM systems need traces, sources, tool calls, prompt versions, eval results, and human decisions.

May 10, 2026 · 4 min · 812 words · Thomas De Vos
Read LLM observability is not a dashboard. It is a replayable trail.
A Claude Code review packet showing objective, permission boundary, tool trace, tests, cost, and rollback path before human approval

The Claude Code review packet I want before approving agent work

A Claude Code diff is not enough evidence for production review. Ask for the objective, permission boundary, tool trace, tests, failures, cost, and rollback path before approving agent work.

May 4, 2026 · 7 min · 1299 words · Thomas De Vos
Read The Claude Code review packet I want before approving agent work
Agent flight recorder diagram showing prompt, files changed, commands, tests, approvals, and rollback notes between a coding agent and deployment gate

Claude Code Agents Need a Flight Recorder

If a Claude Code agent changes production code, the useful artifact is not the chat transcript. It is a flight recorder: intent, boundaries, commands, diffs, tests, approvals, and rollback notes.

April 27, 2026 · 6 min · 1081 words · Thomas De Vos
Read Claude Code Agents Need a Flight Recorder