Blog | Eric Caskey

Goodbye Opus, Hello Fable

June 9, 2026 · 3 min read

Anthropic shipped Claude Fable 5 and Mythos 5: same model, two names, one safeguard layer apart. What the new frontier model means for running agents in production.

Read more →

Context architecture beats documentation dumps

June 8, 2026 · 11 min read

Dumping the whole corpus into an AI agent makes it worse, not better. The fix is architectural: each task loads a curated slice, not everything you have. Here is the method, and the same move at three different layers: specs, sensor data…

Read more →

The Orange Pi That Maintains Itself

June 6, 2026 · 9 min read

A small ARM box that started as a local LLM experiment and ended up a self-governing node: private retrieval, a resident agent under a written constitution, a code-enforced safety fence, and a nightly job where it audits itself and files…

Read more →

An orchestration mode is only as good as its backlog

May 31, 2026 · 4 min read

Anthropic published a guide on building a session-level orchestration mode. I built it two ways, on the CLI and on the API, and then hit the part the guide does not cover: an orchestrator that fans out is useless without a backlog of rea…

Read more →

Wiring Garmin Into My Marathon Coach: A Live Data Integration Without an Official API

May 31, 2026 · 6 min read

How I replaced manual CSV exports with a live Garmin data feed for my AI marathon coach: a scheduled unofficial-API poller, resilient session handling, and the design calls that keep training and recovery data fresh and trustworthy.

Read more →

A Boring Design Let Me Run a Black Swan on a Tuesday

May 28, 2026 · 8 min read

Two posts ago I bet that keeping my portfolio reviewer's engine deterministic and auditable was worth it. This is where that bet paid off: because the engine is replayable, I could run a simulated market crash through the real production…

Read more →

The caskeycoding.com tech stack at a glance

May 25, 2026 · 3 min read

A high-level tour of the technologies running this site: Next.js on CloudFront, Python Lambdas behind API Gateway, DynamoDB plus S3, Anthropic's API with a Bedrock fallback, and AWS CDK wiring it together.

Read more →

Building a Personal Finance Reviewer: What Survived the Rewrite

May 19, 2026 · 5 min read

A personal portfolio reviewer where the scoring is deterministic and the AI only narrates. The architecture that held up after I had to rewrite the model it was built on, and why that boundary is the whole point.

Read more →

When the Spec Was Wrong: Rewriting a Shipped Decision

April 26, 2026 · 7 min read

Two weeks after I shipped a post about a scoring engine I'd built, I rewrote the spec it was based on. Here's what I learned, and why I had an AI agent do the literature review.

Read more →

One week of SDD in production: the numbers

April 20, 2026 · 2 min read

The previous two posts made claims. Here is what a week of the workflow looks like as a data trail, PRs, deploys, CI runs, specs merged, pulled from GitHub.

Read more →

SDD isn't about managing AI agents, it's about managing context

April 19, 2026 · 4 min read

Spec-driven development reads like a methodology for controlling AI agents. It isn't. It's a methodology for managing context across stateless sessions. The spec is the persistent memory.

Read more →

Specs in, deploys out, no keyboard

April 18, 2026 · 3 min read

Two production sites, a blog, and two personal AI projects, shipped this week from a phone. The chain is voice dictation into Perplexity Computer, a spec, then Claude Code on the web. The interaction model is the story.

Read more →

Building an AI Marathon Coach: Deterministic Rules, LLM Narratives, and the 2026 NYC Marathon

April 13, 2026 · 7 min read

How I built a personal AI coaching system for marathon training, layering deterministic guardrails over an LLM narrative engine, ingesting Garmin FIT files, and designing for my own injury history.

Read more →

Designing Safety Guardrails for Distributed Workflow Orchestration

April 10, 2026 · 6 min read

Patterns for pre-execution safety checks, parallel validation, opt-out design, and extensible guardrail architecture on workflow platforms.

Read more →

Spec-Driven Development and the Folder Architecture That Makes It Work

June 20, 2025 · 12 min read

Why spec-driven development and structured folder architecture are the missing infrastructure for AI-assisted engineering: methodology, common mistakes, and where to start.

Read more →

Book Review: Enterprise Vibe Coding Playbook, Building Real Software with AI

June 10, 2025 · 8 min read

A practitioner's review of Doug Kerwin's Enterprise Vibe Coding Playbook, why AI as a thinking partner, not a replacement, is the framework enterprise engineering teams need.

Read more →

Welcome: Building Platforms for Scale

May 15, 2025 · 2 min read

An introduction to the blog, reflections on infrastructure monitoring, platform leadership, and building systems that empower organizations to innovate safely at scale.

Read more →