Caskey Engineering

Eric Caskey

Platform Primitives for Engineering Organizations

I build platform primitives at Amazon within an organization whose mandate is to be a force multiplier for service teams. I own the monitoring platform that standardizes infrastructure monitoring across the Amazon fleet, and built a workflow orchestration control plane now supporting millions of executions across development teams in Dublin, Seattle, San Jose, and New York. That platform is on a path to General Availability for all of Amazon, enabling engineers to build safety-adherent workflows that meet reliability requirements.

Eric Caskey

What I Build

Safety Systems

Platform-enforced safety for automated infrastructure operations. Teams build workflows; the platform ensures they meet reliability standards before anything executes.

Platform Monitoring

Infrastructure observability standardized at enterprise scale — 400,000+ monitors at Prudential, and monitoring standards adopted across thousands of services at Amazon.

AI-Augmented Engineering

A spec architecture where AI agents inherit curated context rather than full codebases — the methodology behind caskeycoding.com, applied to production platform infrastructure.

View Case Studies →

Tools

Latest post

One week of SDD in production: the numbers

April 20, 2026 · 2 min read

One week of SDD in production: the numbers The previous two posts made claims. This one is the data trail, pulled from the GitHub API across five repositories for the week of April 14–21, 2026. Same workflow in every

Read more →