Durable State for Long-Running LLM Coding Sessions
A practical workflow for separating planning from execution, checkpointing progress in GitHub issues, and resuming multi-phase LLM implementation without context collapse.
10 posts
A practical workflow for separating planning from execution, checkpointing progress in GitHub issues, and resuming multi-phase LLM implementation without context collapse.
Why teams get more leverage from agent systems than chat interfaces, and the minimum architecture required to make agents reliable.
A practical look at Paperclip's model for running teams of AI agents around business goals instead of individual tasks, and why that changes the operating model of software and services.
A practical control plane for keeping AI coding sessions on track: separate planning from execution, validate deterministically, reset context aggressively, and isolate parallel work.
A deep dive into choosing a modern open database stack with vector capabilities and clear operational tradeoffs.
A DBA-friendly explanation of how vector search works, why GPUs help, and where vector retrieval fits inside modern database and AI systems.
A DBA-friendly walkthrough of how modern GPU databases execute large analytical SQL queries using columnar storage, parallel scans, and GPU aggregation.
A practical, DBA-friendly explanation of why modern analytical databases are increasingly using GPUs for scans, joins, aggregations, and AI-adjacent workloads.
A DBA-friendly explanation of SIMD and SIMT using query execution, vectorized processing, and GPU mental models instead of hardware jargon.
A practical, DBA-friendly guide to understanding CPU, GPU, and TPU architectures using database mental models instead of hardware jargon.