Writing from the operating room.

What we've learned shipping AI agents to production at growth-stage companies. No thought-leadership. Original data, customer-signed numbers, and the boring infra wins behind the headlines.

May 18 · Eng · 14 min read

The verifier-executor split: why we ship two models, not one.

An independent verifier model trained on customer-corrected outputs catches 4× more bad outputs than self-consistency alone. The architecture, the eval, and what we got wrong twice.

Read article →

May 04 · GTM · 9 min read

Selling hours, not seats: a 2026 pricing autopsy.

What happens when you reprice your product against the labor it replaces. Six months of usage data, three painful contract resets, one obvious conclusion.

Read article →

Apr 22 · Eng · 20 min read

On-device PII redaction: building a privacy-safe observer.

The browser extension that captures cross-app workflows. Presidio + a custom classifier. Why we open-sourced the client and what threats we found.

Read article →

Apr 09 · Research · 7 min read

Eval-driven development for agents.

Golden datasets, regression nets, shadow runs in production. Why our PRs block on accuracy, not on test pass rate.

Read article →

Mar 27 · Field · 11 min read

What 47 design partners taught us about RevOps.

The patterns we saw, the ones we expected and didn't see, and the single workflow that pays for the year.

Read article →

Mar 14 · Brand · 6 min read

Why we look like a magazine, not a SaaS.

The visual system behind Routix. Editorial typography, Swiss grid, one signal color — and the deliberate absence of gradients and glow.

Read article →