Writings and Learnings

Essays on AI agents, full-stack systems, and building in public.

2026-05-23
GitHub Spec-Kit: Executable Blueprints for SDD

Spec-Driven Development turns documentation into executable blueprints for AI agents.

spec-kitgithubSDDai-agents
2026-05-23
Effective Harnesses for Long-Running Agents

Overcoming the Long-Running Agent Problem with a Two-Fold System and structured state management.

anthropicagentslong-runningllm
2026-05-23
Beyond pass@1: A Reliability Science Framework for AI

Transitioning from simple performance metrics to a robust Reliability Science framework for evaluating long-horizon LLM agents.

arxivllmbenchmarkingreliability
2026-05-23
Effective Context Engineering for AI Agents

Context Engineering is the art of curating the optimal set of tokens to maximize an agent's attention budget.

anthropiccontextllmprompt-engineering
2026-05-20
NRQL Predictions: Forecasting Future Trends in New Relic

How NRQL predictions use Holt-Winters exponential smoothing to forecast time series metrics.

observabilityNRQLMLforecasting
2026-05-15
How I Reduced Claude API Cost by 99% with Prompt Chaining

From $1.00 to $0.01 per slide — redesigning an AI presentation pipeline with structured outputs and prompt chaining.

AILLMClaudeoptimizationprompt-engineering
2026-05-10
Building Real-Time Systems: WebSockets, Redis, and Sub-50ms Latency

Architecture decisions behind a real-time chess platform with sub-50ms move propagation and ELO matchmaking.

websocketsredisreal-timesystemstypescript