Database, Cloud, and AI Engineering Notes from Production Systems

Practical architecture reviews, failure-mode analysis, and operating models for teams building database-backed systems, cloud platforms, and AI-assisted engineering workflows.

Start Here

New to the site? These posts are ordered as a reading path, while Latest Notes below is ordered by publication date.

  1. AI Token Cost Overruns: Why AI Coding Assistants Are Becoming the New Cloud Bill Problem AI Engineering · L2 Deep Dive
  2. Harness Engineering: The 2026 Breakthrough Concept AI Engineering · L1 Field Note
  3. Agent Productivity Depends on Context Throughput AI Engineering · L2 Deep Dive
  4. Database Runbooks as Agent Contracts Databases · L1 Field Note
  5. Cloud Architecture Review Checklist for Database-Backed Applications Databases · L3 Reference Guide
  6. Terraform in CI/CD: Plan, Review, Apply, Lock, and Rollback Boundaries Cloud & Platform · L2 Deep Dive

Latest Notes

Recent field notes and breakdowns across AI engineering, databases, cloud, and system design.

Topics

Browse by the production problems the notes are written around.

69 posts

AI Engineering

Agents, context engineering, harness design, MCP, evaluation, token efficiency, and AI-assisted engineering workflows.

  • AI Token Cost Is the New Cloud Bill
  • Build vs Buy: The AI Platform Architecture Decision
  • AI Governance for Engineering Teams: Preventing Shadow AI Spend Without Blocking Innovation
102 posts

Databases

PostgreSQL, Aurora, MySQL, Oracle, Cassandra, MongoDB, pgvector, replication, migrations, indexing, and database operations.

  • Datadog DBM: What Database Teams Should Actually Monitor
  • Why Database Engineers Should Care About AI Cost Engineering
  • How to Run a Database Cost & Reliability Review
86 posts

Cloud & Platform

AWS, Azure, GCP, OCI, Terraform, Kubernetes, CI/CD, Cloudflare, developer platforms, and operational control planes.

  • The Math Behind Database Reserved Instances: When to Wait
  • BigQuery Cost Optimization: On-Demand vs Slot Commitments
  • Database Licensing Cost Across AWS, Azure, GCP, and OCI
50 posts

System Design

Architecture reviews, scalability, failure modes, guardrails, distributed systems, reliability boundaries, and production tradeoffs.

  • Why Your Non-Prod Databases Cost as Much as Production
  • 330 Redundant Data Centers All Failed Simultaneously — Because They Were Identical
  • The End of Single-Signal Alerting: Correlating Metrics, Logs, Traces, Deployments, and Cost
24 posts

Engineering Fundamentals

Core engineering principles, debugging workflows, observability, performance basics, reviews, and practical operating habits.

  • AI Cost Observability Dashboard: LangSmith vs Helicone
  • Alert Fatigue Engineering: How to Build Fewer, Better, Actionable Alerts
  • Cost Observability: Build Dashboards That Show Waste Before Finance Finds It
73 posts

Field Notes

Short practical observations, checklists, production lessons, debugging notes, and decision patterns from real engineering work.

  • Datadog DBM: What Database Teams Should Actually Monitor
  • AI Token Cost Is the New Cloud Bill
  • Why Database Engineers Should Care About AI Cost Engineering

Series

Multi-post arcs that connect practical decisions across a topic.