Vector Search Systems

CPU vs GPU vs TPU Explained for Database Engineers

How CPU, GPU, and TPU architectures differ in ways that matter for databases and AI workloads — and which compute class to reach for when adding vector search, embedding generation, or GPU-accelerated analytics.

All Posts

Mar 2, 2024 5 min read

L1 Field Note

CPU vs GPU vs TPU Explained for Database Engineers

Jun 3, 2024 7 min read

L2 Deep Dive

pgvector Basics: Embeddings Inside PostgreSQL

How pgvector adds vector storage and similarity search to PostgreSQL, what the three distance operators do, and the index you must create before you hit 100K rows.

#databases #vector-db #ai-engineering

Mar 3, 2024 5 min read

L1 Field Note

SIMD vs SIMT Explained for Database Engineers

A DBA-friendly explanation of SIMD and SIMT using query execution, vectorized processing, and GPU mental models instead of hardware jargon.

#databases #cpu #gpu #performance

Mar 4, 2024 5 min read

L1 Field Note

Why Databases Are Moving Toward GPU Execution Engines

A practical, DBA-friendly explanation of why modern analytical databases are increasingly using GPUs for scans, joins, aggregations, and AI-adjacent workloads.

Jun 5, 2023 10 min read

L3 Reference Guide

Cloud Database Cost Triage: Storage, IOPS, CPU, Replicas

A structured runbook for identifying which cost dimension is driving your AWS RDS or Aurora bill before making any changes.

#databases #cloud #checklist

Mar 5, 2024 5 min read

L1 Field Note

How a 10 Billion Row SQL Query Runs in 200ms on a GPU Database

A DBA-friendly walkthrough of how modern GPU databases execute large analytical SQL queries using columnar storage, parallel scans, and GPU aggregation.

Mar 6, 2024 4 min read

L1 Field Note

Vector Search on GPU Databases

A DBA-friendly explanation of how vector search works, why GPUs help, and where vector retrieval fits inside modern database and AI systems.

#databases #gpu #vector-search #retrieval

May 16, 2024 6 min read

L2 Deep Dive

Vectorless RAG Patterns for Database Knowledge Systems

How tree-based retrieval can improve DB runbooks, schema docs, and incident knowledge over chunked vector search.

#databases #vector-db #ai-engineering

Jun 14, 2022 4 min read

L1 Field Note

#databases #fundamentals #architecture

B-tree vs LSM Tree: The Storage Engine Tradeoff

Why PostgreSQL and MySQL use B-trees while Cassandra and RocksDB use LSM trees — the read/write tradeoff that determines which storage engine fits your workload.

Jul 20, 2023 7 min read

L2 Deep Dive

System Design

OCI E-Commerce Database Architecture: Autonomous Transaction Processing, GoldenGate, and Object Storage

Isolating the OCI Autonomous Transaction Processing write path from catalog and analytics load using GoldenGate replication and Object Storage offloading.

#architecture #system-design #cloud

Oct 3, 2023 6 min read

L2 Deep Dive

System Design

Shopping Cart Storage: Session Cache, Durable Cart, and Recovery Semantics

Session cache versus durable cart: the recovery semantics that determine data survival across session loss, browser closure, and checkout failure.

#architecture #system-design #cloud

Jul 14, 2024 7 min read

L2 Deep Dive

System Design

Cloud Cost Triage Workflow: Compute, Storage, Data Transfer, Logs, and Managed Services

Cloud cost triage across compute, storage, data transfer, logs, and managed services — a repeatable workflow for finding runaway spend before the bill arrives.

#architecture #system-design #cloud

Aug 30, 2025 12 min read

L3 Reference Guide

The Semantics AI Misses When Porting Storage Designs

Why a PostgreSQL double write buffer prototype failed despite compiling, and what it reveals about AI-assisted systems design.

#databases #ai-engineering #failures

Apr 22, 2026 7 min read

L2 Deep Dive