Recent Articles

85 posts · sorted by date

July 26, 2026 9 min · read

5/5 - Four Layers, One Agent System: Choosing the Right Engineering Lever

A comparative field guide to prompt, context, harness, and loop engineering, including the unit of work, failure signal, owner, and practical debugging order for each layer.

July 25, 2026 8 min · read

4/5 - When Done Actually Means Done: Loop Engineering for Long-Running Agents

A careful guide to loop engineering: goals, progress, iteration budgets, no-progress detection, verification, pause and resume, and the controls that keep autonomous runs finite.

July 24, 2026 9 min · read

3/5 - The Agent's Operating System: Harness Engineering Without the Hype

A production guide to the software around an agent model: tools, permissions, state, retries, idempotency, verification, observability, and the boundaries that make action safe.

July 23, 2026 8 min · read

2/5 - The Working Set, Not the Window: Context Engineering for Agents

A detailed guide to context engineering: selecting, ranking, compressing, and expiring the information an agent needs on each turn without drowning the model in a giant context window.

July 22, 2026 9 min · read

1/5 - The Message That Survives Reality: Prompt Engineering for Working Systems

A practical, production-minded guide to prompt engineering: composing instructions, examples, constraints, schemas, and tests that remain reliable when models, users, and tools vary.

July 21, 2026 10 min · read

21/21 - Below PyTorch: Profiling, Compilation, and CUDA Kernel Optimization

A production-focused guide to below pytorch: profiling, compilation, and cuda kernel optimization, with architecture, capacity math, failure analysis, and operational controls.

July 20, 2026 10 min · read

20/21 - The Ground Beneath AI: Linux, Networking, and Storage

A production-focused guide to the ground beneath ai: linux, networking, and storage, with architecture, capacity math, failure analysis, and operational controls.

July 19, 2026 10 min · read

19/21 - Shipping Models Like Software: CI/CD, MLflow, and Registries

A production-focused guide to shipping models like software: ci/cd, mlflow, and registries, with architecture, capacity math, failure analysis, and operational controls.

July 18, 2026 10 min · read

18/21 - Assume the Prompt Is Hostile: Security and Guardrails

A production-focused guide to assume the prompt is hostile: security and guardrails, with architecture, capacity math, failure analysis, and operational controls.

July 17, 2026 10 min · read

17/21 - From Kafka to Tokens: Streaming Data and Online Inference

A production-focused guide to from kafka to tokens: streaming data and online inference, with architecture, capacity math, failure analysis, and operational controls.

Next 2/9