Posts tagged AI

26 posts

Methodology
When your method repo and your product repo don't talk to each other
July 16, 2026 · 7 min read
I built a method as a public repo and the product that runs it as two private ones, and none of them treated the others as a source of truth. The domain enum lived in four places. A persona drifted between its lens file and its API contr…
Read more →
Methodology
What the SDD Playbook Did Not Cover
July 12, 2026 · 4 min read
Three months ago I laid out spec-driven development and the folder architecture that makes it work. Most of the playbook held up in daily production use. Three ideas that essay never mentioned turned out to matter more than anything in i…
Read more →
AI Engineering
Fable Thinks, Sonnet Builds
July 10, 2026 · 5 min read
I hit the Fable usage cap twice in under 48 hours and nearly ran out the total token limit. The plan that would have prevented it was published on this blog a month ago. Here is why it failed anyway, where the plan lives now, and what th…
Read more →
Personal
I Keep My Whole Life in Spec Files. My Agent Reads Them and Never Writes Them.
July 10, 2026 · 6 min read
Spec-driven development, pointed at a life. Why my principles and goals live as markdown an AI agent reasons over, and the one rule that makes handing an agent your life safe: it can challenge the record, but it never writes it.
Read more →
Finance
The Pocket Quant
July 10, 2026 · 4 min read
I built a quant research platform, then built an agent to operate it: a scheduled Claude session that reads the boards, keeps a pre-registered track record, and texts me three times a day without ever saying buy.
Read more →
AI Engineering
Ballast: An LLM App Whose Best Feature Is Saying 'I Don't Know'
June 27, 2026 · 6 min read
I built a self-healing RAG pipeline, a guardrails gateway, and an eval gate as one system, then threw 44 adversarial questions at it. Zero hallucinations, because the most important thing it does is refuse. Here is how trust got built in…
Read more →
AI Engineering
Building an AI-Native Platform: A Retrospective
June 26, 2026 · 13 min read
A year of building and operating a small fleet of finance and content products almost entirely through an AI coding agent. What worked, what was hard, the honest failures (including a flagship signal that measured nothing and an edge tha…
Read more →
AI Engineering
Prompt caching is a prefix match, not a flag
June 26, 2026 · 10 min read
Prompt caching looks like a flag you flip for a cheaper bill. It is really the reuse of a stored prompt prefix, governed by three rules, and applying it across four parts of my own system showed where it pays, where it quietly does nothi…
Read more →
Finance
Composite What You Trust, Watch What You Don't: A Trust Boundary for Data With Money Attached
June 17, 2026 · 10 min read
Every system that fuses signals into one consequential number has a fault line: the data you trust enough to composite into a grade versus the data you only trust enough to watch. How I drew that boundary in my personal finance engine, a…
Read more →
AI Engineering
Hello Again, Opus
June 13, 2026 · 5 min read
Four days after I said goodbye to Opus, an export-control directive pulled Fable 5 offline and the fallback became the workhorse again. What I shipped in the window, what it cost, and the model-tiering plan for when Fable comes back.
Read more →
Methodology
Ten days of June: the SDD velocity numbers, seven weeks in
June 10, 2026 · 6 min read
In April I published one week of SDD production numbers. The same data trail rerun for June 1 through 10 shows the velocity curve: 309 PRs opened, 293 merged, about 185 production deploys, and one footnote about outrunning GitHub Actions…
Read more →
AI Engineering
Autonomy is mostly knowing when to stop
June 9, 2026 · 7 min read
I handed a backlog to Claude Fable, told it once it could merge, and let it run. It shipped seventeen items across five repos. The line that mattered was not in the work it finished. It was in the work it refused to touch.
Read more →
AI Engineering
Goodbye Opus, Hello Fable
June 9, 2026 · 3 min read
Anthropic shipped Claude Fable 5 and Mythos 5: same model, two names, one safeguard layer apart. What the new frontier model means for running agents in production.
Read more →
AI Engineering
Context architecture beats documentation dumps
June 8, 2026 · 11 min read
Dumping the whole corpus into an AI agent makes it worse, not better. The fix is architectural: each task loads a curated slice, not everything you have. Here is the method, and the same move at three different layers: specs, sensor data…
Read more →
AI Engineering
The Orange Pi That Maintains Itself
June 6, 2026 · 9 min read
A small ARM box that started as a local LLM experiment and ended up a self-governing node: private retrieval, a resident agent under a written constitution, a code-enforced safety fence, and a nightly job where it audits itself and files…
Read more →
AI Engineering
An orchestration mode is only as good as its backlog
May 31, 2026 · 4 min read
Anthropic published a guide on building a session-level orchestration mode. I built it two ways, on the CLI and on the API, and then hit the part the guide does not cover: an orchestrator that fans out is useless without a backlog of rea…
Read more →
AI Engineering
Wiring Garmin Into My Marathon Coach: A Live Data Integration Without an Official API
May 31, 2026 · 6 min read
How I replaced manual CSV exports with a live Garmin data feed for my AI marathon coach: a scheduled unofficial-API poller, resilient session handling, and the design calls that keep training and recovery data fresh and trustworthy.
Read more →
Finance
A Boring Design Let Me Run a Black Swan on a Tuesday
May 28, 2026 · 8 min read
Two posts ago I bet that keeping my portfolio reviewer's engine deterministic and auditable was worth it. This is where that bet paid off: because the engine is replayable, I could run a simulated market crash through the real production…
Read more →
Finance
Building a Personal Finance Reviewer: What Survived the Rewrite
May 19, 2026 · 5 min read
A personal portfolio reviewer where the scoring is deterministic and the AI only narrates. The architecture that held up after I had to rewrite the model it was built on, and why that boundary is the whole point.
Read more →
Finance
When the Spec Was Wrong: Rewriting a Shipped Decision
April 26, 2026 · 7 min read
Two weeks after I shipped a post about a scoring engine I'd built, I rewrote the spec it was based on. Here's what I learned, and why I had an AI agent do the literature review.
Read more →
Methodology
One week of SDD in production: the numbers
April 20, 2026 · 2 min read
The previous two posts made claims. Here is what a week of the workflow looks like as a data trail, PRs, deploys, CI runs, specs merged, pulled from GitHub.
Read more →
AI Engineering
SDD isn't about managing AI agents, it's about managing context
April 19, 2026 · 4 min read
Spec-driven development reads like a methodology for controlling AI agents. It isn't. It's a methodology for managing context across stateless sessions. The spec is the persistent memory.
Read more →
AI Engineering
Specs in, deploys out, no keyboard
April 18, 2026 · 3 min read
Two production sites, a blog, and two personal AI projects, shipped this week from a phone. The chain is voice dictation into Perplexity Computer, a spec, then Claude Code on the web. The interaction model is the story.
Read more →
AI Engineering
Building an AI Marathon Coach: Deterministic Rules, LLM Narratives, and the 2026 NYC Marathon
April 13, 2026 · 6 min read
How I built a personal AI coaching system for marathon training, layering deterministic guardrails over an LLM narrative engine, ingesting Garmin FIT files, and designing for my own injury history.
Read more →
Methodology
Spec-Driven Development and the Folder Architecture That Makes It Work
April 9, 2026 · 12 min read
Why spec-driven development and structured folder architecture are the missing infrastructure for AI-assisted engineering: methodology, common mistakes, and where to start.
Read more →
Methodology
Book Review: Enterprise Vibe Coding Playbook, Building Real Software with AI
April 8, 2026 · 8 min read
A practitioner's review of Doug Kerwin's Enterprise Vibe Coding Playbook, why AI as a thinking partner, not a replacement, is the framework enterprise engineering teams need.
Read more →