Topic

AI Engineering

13 posts

AI Engineering
Ballast: An LLM App Whose Best Feature Is Saying 'I Don't Know'
June 27, 2026 · 6 min read
I built a self-healing RAG pipeline, a guardrails gateway, and an eval gate as one system, then threw 44 adversarial questions at it. Zero hallucinations, because the most important thing it does is refuse. Here is how trust got built in…
Read more →
AI Engineering
Building an AI-Native Platform: A Retrospective
June 26, 2026 · 13 min read
A year of building and operating a small fleet of finance and content products almost entirely through an AI coding agent. What worked, what was hard, the honest failures (including a flagship signal that measured nothing and an edge tha…
Read more →
AI Engineering
Prompt caching is a prefix match, not a flag
June 26, 2026 · 10 min read
Prompt caching looks like a flag you flip for a cheaper bill. It is really the reuse of a stored prompt prefix, governed by three rules, and applying it across four parts of my own system showed where it pays, where it quietly does nothi…
Read more →
AI Engineering
Hello Again, Opus
June 13, 2026 · 5 min read
Four days after I said goodbye to Opus, an export-control directive pulled Fable 5 offline and the fallback became the workhorse again. What I shipped in the window, what it cost, and the model-tiering plan for when Fable comes back.
Read more →
AI Engineering
Autonomy is mostly knowing when to stop
June 9, 2026 · 7 min read
I handed a backlog to Claude Fable, told it once it could merge, and let it run. It shipped seventeen items across five repos. The line that mattered was not in the work it finished. It was in the work it refused to touch.
Read more →
AI Engineering
Goodbye Opus, Hello Fable
June 9, 2026 · 3 min read
Anthropic shipped Claude Fable 5 and Mythos 5: same model, two names, one safeguard layer apart. What the new frontier model means for running agents in production.
Read more →
AI Engineering
Context architecture beats documentation dumps
June 8, 2026 · 11 min read
Dumping the whole corpus into an AI agent makes it worse, not better. The fix is architectural: each task loads a curated slice, not everything you have. Here is the method, and the same move at three different layers: specs, sensor data…
Read more →
AI Engineering
The Orange Pi That Maintains Itself
June 6, 2026 · 9 min read
A small ARM box that started as a local LLM experiment and ended up a self-governing node: private retrieval, a resident agent under a written constitution, a code-enforced safety fence, and a nightly job where it audits itself and files…
Read more →
AI Engineering
An orchestration mode is only as good as its backlog
May 31, 2026 · 4 min read
Anthropic published a guide on building a session-level orchestration mode. I built it two ways, on the CLI and on the API, and then hit the part the guide does not cover: an orchestrator that fans out is useless without a backlog of rea…
Read more →
AI Engineering
Wiring Garmin Into My Marathon Coach: A Live Data Integration Without an Official API
May 31, 2026 · 6 min read
How I replaced manual CSV exports with a live Garmin data feed for my AI marathon coach: a scheduled unofficial-API poller, resilient session handling, and the design calls that keep training and recovery data fresh and trustworthy.
Read more →
AI Engineering
SDD isn't about managing AI agents, it's about managing context
April 19, 2026 · 4 min read
Spec-driven development reads like a methodology for controlling AI agents. It isn't. It's a methodology for managing context across stateless sessions. The spec is the persistent memory.
Read more →
AI Engineering
Specs in, deploys out, no keyboard
April 18, 2026 · 3 min read
Two production sites, a blog, and two personal AI projects, shipped this week from a phone. The chain is voice dictation into Perplexity Computer, a spec, then Claude Code on the web. The interaction model is the story.
Read more →
AI Engineering
Building an AI Marathon Coach: Deterministic Rules, LLM Narratives, and the 2026 NYC Marathon
April 13, 2026 · 7 min read
How I built a personal AI coaching system for marathon training, layering deterministic guardrails over an LLM narrative engine, ingesting Garmin FIT files, and designing for my own injury history.
Read more →