Writing

On building AI products that actually ship.

AI architecture, startup technical decisions, and what it takes to go from demo to 2am reliability.

June 19, 2026

aiagentsproduction

I Let AI Agents Open Pull Requests Against Production Code

An agent that turns a production error into a drafted, tested pull request — before anyone wakes up. What it took to trust it, and the guardrails that make it safe.

Read →

May 22, 2026

aievaluationproduction

How to Evaluate AI Output at Scale Without Reading 10,000 Responses

Reading more outputs isn't evaluation. How to build an LLM judge that routes human attention instead of trying to replace it.

Read →

May 22, 2026

aiarchitectureproduction

Why Your AI Agent Passes All Tests and Fails at 2am

The gap between demo reliability and production reality — and what to actually do about it.

Read →