#architecture

206 posts

Jun 5, 2026 11 min read

L3 Reference Guide

Build vs Buy: The AI Platform Architecture Decision

Evaluating the architectural tradeoffs between turnkey AI coding tools and building an internal AI gateway — with design options, failure modes, and implementation guidance.

#ai-engineering #architecture #cloud

Jun 2, 2026 6 min read

L2 Deep Dive

AI Engineering

AI Governance for Engineering Teams: Preventing Shadow AI Spend Without Blocking Innovation

How to govern LLM API spend using centralized gateways without slowing down developer velocity, drawing on established cloud cost control patterns.

#ai-engineering #cloud #architecture #failures

May 31, 2026 6 min read

L2 Deep Dive

AI Engineering

AI Token Cost Overruns: Why AI Coding Assistants Are Becoming the New Cloud Bill Problem

Why AI coding assistant spend needs cloud-style FinOps controls before agent loops, context growth, and workspace credits become a surprise bill.

#ai-engineering #cloud #architecture

May 29, 2026 7 min read

L2 Deep Dive

AI Engineering

Agent Productivity Depends on Context Throughput

AI coding agents work better when voice, clipboard, screenshots, and MCP tools reduce context friction.

#ai-engineering #architecture #checklist

May 28, 2026 17 min read

L3 Reference Guide

Databases

Per-App Postgres on Kubernetes Changes the Failure Boundary

How CloudNativePG, GitOps, and external secrets make per-application Postgres viable without hiding the operational cost.

#databases #cloud #architecture

May 27, 2026 7 min read

L2 Deep Dive

AI Engineering

AI Cost Incident Runbook: What to Do When Monthly Token Spend Suddenly Doubles

An operational playbook for triaging and containing LLM token spend spikes — from alert fire to root cause within 30 minutes.

#ai-engineering #failures #architecture #checklist

May 25, 2026 6 min read

L2 Deep Dive

Databases

Azure Database for PostgreSQL: Flexible Server vs Hyperscale (Citus) Architecture Decision

When to choose Azure Flexible Server vs Citus for PostgreSQL on Azure — failover behavior, connection pooling, and the workload shapes where each architecture wins and breaks.

#databases #cloud #architecture

May 25, 2026 7 min read

L2 Deep Dive

Databases

Cassandra Write Path Fundamentals for Database Engineers

How Cassandra's commit log, Memtable, and SSTable pipeline works, why write amplification is the dominant operational cost, and how compaction strategy selection changes it.

#databases #architecture

May 25, 2026 6 min read

L2 Deep Dive

Databases

GCP AlloyDB vs Cloud SQL for PostgreSQL: When to Upgrade

When Cloud SQL's managed PostgreSQL hits its limits and AlloyDB's columnar cache and HTAP architecture become worth the migration complexity and cost jump.

#databases #cloud #architecture

May 24, 2026 9 min read

L2 Deep Dive

Databases

The Stack for AI-Accelerated Database Operations Is Now Open Source

Three May 2026 breakout projects close the gaps that stop database teams from moving schema changes, query assistance, and operational workflows to AI: declarative Postgres migrations, local LLM inference, and a full agent platform.

#databases #ai-engineering #architecture

May 16, 2026 6 min read

L2 Deep Dive

Databases

Stop Writing Ad-Hoc Queries: Build a Skill Backbone for Your DB Engineering Workflows

How to codify repetitive DB tasks into testable, reusable Claude skills that produce consistent SQL, runbooks, and migration outputs instead of one-off chat prompts.

#ai-engineering #databases #architecture

May 12, 2026 7 min read

L2 Deep Dive

AI Engineering

Agentic SRE Architecture: Skills, Agents, MCP Servers, and Human Approval Loops

The definitive 2026 reference architecture for autonomous database operations, from detection to multi-agent diagnosis to human-in-the-loop remediation.

#ai-engineering #architecture #system-design #cloud

May 8, 2026 7 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: April 2026 — Part I

The highest-starred new open-source projects in April 2026 relevant to database engineering, infrastructure, and AI tooling — focused on eliminating manual context re-injection across system design, platform automation, and AI memory.

#ai-engineering #databases #architecture

May 6, 2026 6 min read

L2 Deep Dive

AI Engineering

Prompt Caching, Context Pruning, and Model Routing: Practical Ways to Reduce LLM Cost

How to combine semantic routing, structured context pruning, and prompt caching to reduce production LLM API costs without degrading application quality.

#ai-engineering #architecture #cloud

Apr 29, 2026 4 min read

L1 Field Note

AI Engineering

AI Coding Assistant ROI: When $200/Developer/Month Is Cheap — and When It Is Waste

Why treating AI assistant seats like standard SaaS licenses obscures their true infrastructure cost profile, and how to measure ROI using cloud compute parallels.

#ai-engineering #cloud #architecture #failures

Apr 22, 2026 7 min read

L2 Deep Dive

Databases

Top GitHub Breakouts: March 2026 — Agent Adaptation and Production-Scale Vector Search

The second wave of March 2026 breakouts: an agent that learns from every conversation, a Rust vector index that outperforms FAISS at a fraction of the memory, and a Kubernetes-native agent control plane.

#ai-engineering #databases #architecture

Apr 22, 2026 4 min read

L1 Field Note

AI Engineering

Token Budgeting for Engineering Teams: Daily, Weekly, Monthly Controls by Developer and Repository

How to implement token quotas, chargebacks, and spend controls for AI engineering teams, drawing parallels from cloud database cost management.

#cloud #ai-engineering #architecture

Apr 15, 2026 5 min read

L1 Field Note

Engineering Fundamentals

AI Cost Observability Dashboard: LangSmith vs Helicone

How to build an AI FinOps dashboard and choose between proxy-based and instrumentation-based observability.

#ai-engineering #architecture #checklist

Apr 15, 2026 14 min read

L3 Reference Guide

AI Engineering

GitHub Breakouts: Q1 2026 — The Quarter's Top Productivity Shifts

Six open-source projects from Q1 2026 that converged on eliminating the manual scaffolding between AI agents and production infrastructure: context management, local cloud testing, and vector retrieval.

#ai-engineering #architecture #databases #cloud

Apr 11, 2026 6 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: March 2026 — Part I

Three components AI teams still build by hand — task decomposition graphs, persistent agent workspaces, and path-scored retrieval — each got a breakout open-source release in March 2026 that replaces custom wiring with library calls.

#ai-engineering #architecture

Apr 8, 2026 2 min read

L1 Field Note

System Design

Why Your Non-Prod Databases Cost as Much as Production

Architectural strategies to eliminate waste in Dev, Test, and Staging database environments.

#failures #architecture

Apr 8, 2026 4 min read

L1 Field Note

AI Engineering

Why Agentic AI Costs Explode: Context Size, Tool Calls, MCP Servers, Repo Size, and Retry Loops

Agentic AI systems can quietly accumulate massive API bills due to compounding context windows, retry loops, and unconstrained workspace parsing.

#ai-engineering #architecture #cloud #failures

Apr 1, 2026 2 min read

L1 Field Note

Cloud & Platform

The Math Behind Database Reserved Instances: When to Wait

Why committing to 3-year database reserved instances too early locks in architectural waste.

#cloud #architecture

Mar 25, 2026 5 min read

L2 Deep Dive

AI Engineering

Claude Code Cost Management for Engineering Teams

A deep dive into model routing rules, context pruning with Graphify, and governing agent API spend.

#ai-engineering #architecture

Mar 22, 2026 7 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: February 2026 — Local Agents and MCP Bridges

February 2026's highest-starred new open-source projects connecting AI agents to local infrastructure, Kubernetes clusters, and structured data without cloud API dependencies.

#ai-engineering #cloud #architecture

Mar 18, 2026 2 min read

L1 Field Note

Cloud & Platform

BigQuery Cost Optimization: On-Demand vs Slot Commitments

How to stop runaway BigQuery costs by analyzing query scans, enforcing partitions, and moving to capacity-based pricing.

#cloud #architecture #checklist

Mar 18, 2026 3 min read

L1 Field Note

AI Engineering

The New AI FinOps Model: Seat Cost vs Token Cost vs Agent Runtime Cost

Why traditional SaaS spend models fail for agentic AI, and how platform teams are treating LLM compute like database provisioned IOPS.

#ai-engineering #cloud #architecture #failures

Mar 14, 2026 7 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: February 2026 — Part II

The highest-starred new open-source projects in February 2026 — agent-native LLM routing, free AWS local emulation, and cross-platform semantic memory for AI coding agents.

#ai-engineering #cloud #architecture

Mar 11, 2026 2 min read

L1 Field Note

Databases

Oracle to Aurora PostgreSQL: License Cost Elimination in Practice

The engineering reality and ROI of migrating from Oracle to Amazon Aurora PostgreSQL.

#databases #cloud #architecture

Mar 10, 2026 8 min read

L2 Deep Dive

AI Engineering

MCP Server Observability: The New Control Plane for AI + Enterprise Tools

How the Model Context Protocol (MCP) became the networking layer for AI agents, and why monitoring these connections is critical for enterprise security.

#ai-engineering #architecture #system-design #security

Mar 7, 2026 7 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: February 2026 — Part I

The highest-starred new open-source projects in February 2026 — eliminating the context tax that slows AI-assisted code review, infrastructure generation, and database operations.

#ai-engineering #databases #architecture

Feb 27, 2026 4 min read

L1 Field Note

AI Engineering

Context Anxiety and Harness Decay

Why agent harnesses become stale when they overfit today's model weaknesses instead of stable execution contracts.

#ai-engineering #architecture #failures

Feb 25, 2026 2 min read

L2 Deep Dive

Databases

Azure Hybrid Benefit for SQL Server: The Exact Math

A deep dive into the cost savings and mechanics of applying Azure Hybrid Benefit to SQL Server deployments.

#databases #cloud #architecture

Feb 24, 2026 4 min read

L1 Field Note

AI Engineering

Programmatic Tool Calling for DB Automation

A reference pattern for keeping large database outputs out of model context by using scripts that summarize evidence before the agent sees it.

#databases #ai-engineering #architecture

Feb 20, 2026 4 min read

L1 Field Note

AI Engineering

Tool Search vs Loading Every MCP Tool

Why production agents need discoverable tools and context budgets instead of one giant always-loaded MCP surface.

#ai-engineering #architecture #cloud

Feb 18, 2026 2 min read

L1 Field Note

Databases

Azure Synapse Cost Optimization: DWU Right-Sizing, Serverless, and Hybrid Benefit

How to reduce your Azure Synapse compute bill by right-sizing dedicated pools and offloading to serverless.

#databases #cloud #architecture

Feb 17, 2026 4 min read

L1 Field Note

AI Engineering

Token-Efficient Tool Use

How to design agent tool surfaces that preserve context budget for reasoning instead of wasting it on tool metadata and raw output.

#ai-engineering #architecture

Feb 13, 2026 4 min read

L1 Field Note

AI Engineering

Application Legibility for Agents

A reference architecture for making logs, metrics, test output, schemas, and deployment history readable by coding agents.

#ai-engineering #architecture #cloud

Feb 11, 2026 2 min read

L1 Field Note

Cloud & Platform

Database Licensing Cost Across AWS, Azure, GCP, and OCI

A framework for managing commercial database licensing costs across the four major cloud providers.

#databases #cloud #architecture

Feb 6, 2026 4 min read

L1 Field Note

AI Engineering

Agent-to-Agent Review Loops

A practical review pattern where one agent creates a change and specialized agents review risk, rollback, security, and observability.

#ai-engineering #architecture #checklist

Feb 4, 2026 3 min read

L1 Field Note

Cloud & Platform

Cloud Database Cost Engineering: How to Reduce Database, Data Warehouse, and Licensing Spend Across Azure, AWS, GCP, and OCI

A comprehensive framework for reigning in cloud database costs, focusing on licensing, right-sizing, and architectural tradeoffs.

#databases #cloud #architecture #checklist

Feb 3, 2026 4 min read

L1 Field Note

AI Engineering

Harness Engineering: The 2026 Breakthrough Concept

Why the real engineering surface around agents is the harness of tools, scripts, context, review, and telemetry.

#ai-engineering #architecture

Jan 30, 2026 4 min read

L1 Field Note

Databases

Database Runbooks as Agent Contracts

A reference operating model for turning human database runbooks into machine-usable agent contracts.

#databases #ai-engineering #architecture #checklist

Jan 28, 2026 16 min read

L3 Reference Guide

AI Engineering

GitHub Year in Review: 2025 — What Open Source Changed in the Engineering Stack

Nine breakout repos across four themes — MCP protocol adoption, agent memory infrastructure, AI-native platform ops, and database automation — that eliminated the hand-built glue code between AI agents and production systems.

#ai-engineering #architecture #databases #cloud

Jan 27, 2026 4 min read

L1 Field Note

AI Engineering

The New Engineer Role: Implementer to Orchestrator

Why agentic coding shifts senior engineering work toward decomposition, verification, and operating-model design.

#ai-engineering #architecture

Jan 23, 2026 4 min read

L1 Field Note

Databases

Repo-Embedded Skills for Database Teams

Why database teams should store agent instructions, runbook contracts, and review policies in the repository instead of in memory.

#ai-engineering #databases #architecture

Jan 20, 2026 4 min read

L1 Field Note

Databases

Agentic Code Review for Database Repositories

Database repositories contain hidden rules human reviewers know: never add a blocking index at peak hours, never widen IAM without owner approval. Agent review surfaces these violations before merge — without displacing the human judgment that set the rules.

#ai-engineering #databases #architecture

Jan 20, 2026 8 min read

L2 Deep Dive

AI Engineering

AI Agent Observability: Monitor Tool Calls, Token Spend, Latency, and Failure Loops

Why monitoring autonomous SRE agents requires tracking tool-call hallucinations, context window saturation, and recursive retry loops, rather than just basic CPU metrics.

#ai-engineering #architecture #failures #system-design

Jan 16, 2026 4 min read

L1 Field Note

AI Engineering

Agent Autonomy Ladder: Manual, Confirm, Auto-Approve, Supervised

A governance model for deciding which database and cloud agent actions require approval and which can run automatically.

#ai-engineering #architecture #checklist

Jan 15, 2026 14 min read

L3 Reference Guide

AI Engineering

GitHub Breakouts: Q4 2025 — The Quarter's Top Productivity Shifts

Six open-source projects that collectively delivered the missing infrastructure layer for production AI agents: secure sandboxes, deployment platforms, persistent memory, token-efficient encoding, and AI-native storage.

#ai-engineering #architecture #databases #cloud

Jan 12, 2026 4 min read

L1 Field Note

AI Engineering

Outcome-Based Agent Evaluation vs Transcript Review

A field note on why agent evaluation should measure verified state changes instead of polished reasoning traces.

#ai-engineering #architecture

Jan 9, 2026 5 min read

L1 Field Note

AI Engineering

Evals Are the New Unit Tests for Agents

Why database and cloud teams need agent eval harnesses that grade outcomes, not persuasive transcripts.

#ai-engineering #architecture #checklist

Jan 5, 2026 6 min read

L2 Deep Dive

AI Engineering

Agent Loop Anatomy for DB and Cloud Engineers

A practical mental model for how coding agents plan, call tools, observe results, and complete infrastructure work without treating the model response as the whole system.

#ai-engineering #architecture #databases #cloud

Dec 20, 2025 8 min read

L2 Deep Dive

Databases

Automated Reliability Across the Stack: Database Backups, Platform Observability, and SQL Quality (November 2025)

Three November 2025 open-source releases eliminate manual work from three engineering reliability tasks — multi-database backup verification, self-hosted log and trace collection, and SQL static analysis in CI pipelines.

#databases #ai-engineering #architecture

Dec 16, 2025 8 min read

L2 Deep Dive

Cloud & Platform

The 2026 Automation Roadmap for SRE, DevOps, and Database Teams

The 2026 automation priorities for SRE, DevOps, and database teams: what to finish, what to stop maintaining manually, and where agent workflows are actually production-ready.

#architecture #cloud #checklist

Dec 9, 2025 6 min read

L2 Deep Dive

AI Engineering

Telemetry Cost Control: Why Observability Data Itself Needs Governance

If you log everything and monitor every dimension, your observability bill will eventually exceed your database infrastructure bill. Here is how to fix it.

#cloud #architecture #ai-engineering

Dec 6, 2025 8 min read

L2 Deep Dive

AI Engineering

The AI-Native Engineering Stack: Agents, Inference, and Knowledge Graphs in Production (November 2025)

Three November 2025 breakout projects eliminate the manual infrastructure build that blocks teams from running AI agents in production — covering agent backends, Kubernetes LLM inference, and SQL-driven knowledge retrieval.

#ai-engineering #architecture #cloud

Nov 22, 2025 8 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: October 2025 (Part 2)

October's memory and retrieval breakouts: a structured agent memory framework with benchmarks, a self-hosted cognitive memory engine, and sub-10ms semantic search without a vector database cluster.

#ai-engineering #architecture #databases

Nov 20, 2025 6 min read

L2 Deep Dive

System Design

330 Redundant Data Centers All Failed Simultaneously — Because They Were Identical

Cloudflare's November 2023 outage is a case study in correlated failure. Redundancy protects against independent failures. It does nothing when every node runs the same defective code.

#architecture #failures

Nov 8, 2025 7 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: October 2025 (Part 1)

Three October breakouts targeting LLM prompt verbosity, parallel agent orchestration, and fragmented hybrid search stacks — all reducing coordination overhead in AI engineering.

#ai-engineering #architecture #databases

Oct 25, 2025 11 min read

L3 Reference Guide

Databases

Torn Page Protection Belongs Off the Foreground Path

A PostgreSQL kernel experiment shows why moving torn-page protection from WAL to background flush can change write latency.

#databases #ai-engineering #architecture

Oct 21, 2025 4 min read

L1 Field Note

Engineering Fundamentals

Alert Fatigue Engineering: How to Build Fewer, Better, Actionable Alerts

A dashboard is not observability, and an alert without a specific action is just operational debt masquerading as monitoring.

#failures #checklist #architecture

Oct 15, 2025 14 min read

L3 Reference Guide

AI Engineering

GitHub Breakouts: Q3 2025 — The Quarter's Top Productivity Shifts

Six open-source tools from Q3 2025 that closed the infrastructure gaps blocking AI agents in production: persistent memory, intelligent model routing, and natural language database access.

#ai-engineering #architecture #databases #cloud

Oct 14, 2025 7 min read

L2 Deep Dive

AI Engineering

AI Agents in Platform Automation: Useful Assistant or Unreviewed Change Engine

When AI agents accelerate platform operations versus when they generate unreviewed changes — the permission boundary and audit design that separates useful from risky.

#ai-engineering #architecture #cloud

Oct 7, 2025 13 min read

L2 Deep Dive

Databases

PostgreSQL 18 Replication Upgrade Opportunities

What changes in replication when upgrading from PostgreSQL 14–16 to PostgreSQL 18: parallel apply, pg_createsubscriber, and surfaced conflict visibility.

#databases #architecture #checklist

Sep 6, 2025 7 min read

L2 Deep Dive

Databases

Top GitHub Breakouts: August 2025 — Part I

The gap between AI prototype and production system is routing tables, deployment YAML, and observability scaffolding. August 2025's top breakouts targeted exactly the code engineers keep rewriting: model routing logic, agent deployment manifests, and PostgreSQL diagnostics.

#ai-engineering #architecture #databases

Aug 19, 2025 5 min read

L2 Deep Dive

AI Engineering

FinOps Observability: Tie Cloud Cost to Workload, Team, Product, and Customer

How to connect engineering telemetry with cost telemetry to achieve granular cloud unit economics using FinOps principles and FOCUS standards.

#cloud #architecture #ai-engineering

Jul 26, 2025 19 min read

L3 Reference Guide

Databases

Natural Language SQL Agents Need Database Guardrails

The risk in a natural-language SQL agent is not bad SQL — it is authority compilation: a user sentence becomes a database operation unless the control plane proves, before execution, which role, rows, cost, and columns the query is allowed to touch.

#ai-engineering #databases #architecture

Jul 15, 2025 14 min read

L3 Reference Guide

AI Engineering

GitHub Breakouts: Q2 2025 — The Quarter's Top Productivity Shifts

Six Q2 2025 open-source breakouts that closed the gap between AI agents and engineering infrastructure across system design, platform operations, and database tooling.

#ai-engineering #architecture #databases

Jul 12, 2025 8 min read

L2 Deep Dive

Databases

Covering Indexes Are Not Enough Without Visibility

PostgreSQL index-only scans only stay fast when covering indexes and visibility map maintenance work together.

#databases #architecture #failures

Jul 3, 2025 8 min read

L2 Deep Dive

AI Engineering

Personal AI Agents Fail in the Last 20 Percent of Integration

Self-hosted AI agents become useful only when model quality, tool access, memory, and setup completeness line up.

#ai-engineering #architecture #failures

Jun 25, 2025 9 min read

L2 Deep Dive

AI Engineering

Parallel AI Agents Need an Operating Model

Running many coding agents only works when git isolation, shared memory, permissions, hooks, and verification are designed as a system.

#ai-engineering #architecture #checklist

Jun 22, 2025 8 min read

L2 Deep Dive

Databases

Top GitHub Breakouts: May 2025 — Operational Baseline in a Config File

Three May 2025 open-source projects replace multi-tool assembly in document ingestion, deployment governance, and PostgreSQL backup with single-binary or configuration-first alternatives.

#databases #ai-engineering #architecture

Jun 21, 2025 7 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: May 2025 — Agent Infrastructure Without Boilerplate

Three May 2025 open-source projects eliminate the manual scaffolding that blocks every AI agent deployment: orchestration glue, vector database setup, and MCP gateway configuration.

#ai-engineering #architecture

Jun 17, 2025 6 min read

L2 Deep Dive

System Design

The End of Single-Signal Alerting: Correlating Metrics, Logs, Traces, Deployments, and Cost

Why paging an engineer solely because CPU hit 85% is an anti-pattern, and how to build correlated alerts that require real operational evidence.

#architecture #failures #system-design

Jun 14, 2025 9 min read

L2 Deep Dive

Databases

Three Open-Source Tools Filling the Gaps in Database Operations (May 2025)

May 2025's most-starred new projects solve three specific database team problems: backup restores that are never verified, internal knowledge that can't be retrieved, and AI agents blind to your schema history.

#databases #ai-engineering #architecture

May 17, 2025 8 min read

L2 Deep Dive

AI Engineering

The Three-Layer Agent Infrastructure Stack for Database Operations (April 2025)

Building a database operations agent requires a workflow framework, production observability, and scalable inference — April 2025 shipped open-source solutions for all three layers simultaneously.

#ai-engineering #architecture #cloud

May 12, 2025 7 min read

L3 Reference Guide

Databases

MongoDB Queryable Encryption Architecture Review

A pre-go-live architecture review for MongoDB Queryable Encryption — key management, field classification, query type constraints, driver requirements, and key rotation.

#databases #architecture #checklist

May 3, 2025 6 min read

L2 Deep Dive

AI Engineering

The Architecture of Natural Language Database Interfaces

Replacing the translation overhead between business questions and SQL queries requires an architecture that bridges LLM intent parsing with strict execution validation and schema retrieval.

#databases #ai-engineering #architecture

Apr 26, 2025 8 min read

L2 Deep Dive

Databases

Per-Application Postgres on Kubernetes Is an Isolation Strategy

How CloudNativePG, GitOps, and External Secrets turn Postgres-on-Kubernetes into an operational isolation pattern.

#databases #cloud #architecture

Apr 15, 2025 5 min read

L2 Deep Dive

AI Engineering

Datadog Bits AI SRE: What an AI On-Call Teammate Changes for DBAs

How autonomous AI agents like Bits AI SRE are shifting the database incident workflow from manual dashboard hunting to conversational investigation.

#ai-engineering #cloud #architecture

Apr 15, 2025 14 min read

L3 Reference Guide

Databases

GitHub Breakouts: Q1 2025 — The Quarter's Top Productivity Shifts

Six high-traction open-source projects from Q1 2025 converged on eliminating the manual integration layer between AI assistants and production systems across databases, platform operations, and developer tooling.

#ai-engineering #architecture #databases #cloud

Apr 8, 2025 7 min read

L2 Deep Dive

Databases

Python Automation Framework for DB and Cloud Ops: Architecture and Failure Model

DB and cloud automation fails when partial failures leave the database, cloud account, and ticketing system describing different operation states.

#architecture #cloud #databases

Mar 8, 2025 7 min read

L2 Deep Dive

AI Engineering

Top GitHub Breakouts: February 2025

The highest-starred new open-source projects in February 2025 eliminating manual iteration in prompt engineering, infrastructure monitoring, and private data retrieval.

#ai-engineering #architecture

Mar 1, 2025 6 min read

L2 Deep Dive

AI Engineering

Evaluate AI Agents by Completed Work, Not Token Price

Production AI agent selection should measure quality, retries, tokens, latency, and verification cost per completed task.

#ai-engineering #checklist #architecture

Mar 1, 2025 9 min read

L2 Deep Dive

Databases

Natural Language SQL Agents Need Guardrails Before Orchestration

How Postgres chat agents turn intent into SQL, and why production systems need schema controls, validation, and auditability.

#databases #ai-engineering #architecture

Jan 28, 2025 23 min read

L3 Reference Guide

AI Engineering

GitHub Year in Review: 2024 — What Open Source Changed in the Engineering Stack

Nine breakout repositories across three themes — agents that operated computers, RAG that grew a graph spine, and databases that finally spoke natively to LLMs — define what actually shifted in the engineering stack in 2024.

#ai-engineering #architecture #databases #cloud

Dec 12, 2024 10 min read

L3 Reference Guide

AI Engineering

Prompt Architecture Needs Load Boundaries

The default AI coding setup loads everything into one always-on instruction file. The production alternative is a layered architecture — project memory, task skills, commands, and MCP servers each with a defined load boundary — so context bloat and stale policy stop reaching the model on every turn.

#ai-engineering #architecture #checklist

Dec 11, 2024 7 min read

L2 Deep Dive

Databases

The 2027 Cloud Database Architecture Roadmap

A 2027 cloud database architecture roadmap for teams that can no longer satisfy consistency, latency, residency, and recovery SLOs with a single engine.

#architecture #databases #cloud

Nov 26, 2024 6 min read

L2 Deep Dive

System Design