Home

Papers

Research papers, tutorials, and technical explorations.

Maybe You Don't Need Kubernetes

Kubernetes was built on decade-old assumptions, and its networking model still reflects that history. Modern identity-first networking approaches expose how much operational complexity we've normalized — and offer a path to simpler systems with lower human cost.

Feb 8, 2026

Stop Re-Explaining Your Codebase to Your AI Agent

AI coding agents can write code, but each session starts without knowledge of your repo structure, conventions, or prior work. Scaffold is an open-source tool that gives agents persistent, machine-readable project context through structured briefing files, with explicit controls to detect and limit drift.

Feb 4, 2026

Building Memory as a Service with memrun

Memrun is a Python SDK and platform for deploying data-intensive services that need warm memory, sticky routing, and bounded concurrency. Deploy a handler with a decorator. The platform handles NVMe caching, locality-aware routing via Kafka, and worker lifecycle on commodity hardware.

Jan 22, 2026

Serverless Warm Memory: A Distinct Service Category

Serverless platforms optimized for stateless, elastic compute. But memory-bound workloads need the opposite: warm RAM, fast local disk, and sticky routing. This isn't better serverless. It's a different abstraction entirely. We describe the programming model, the platform mechanics, and how memrun implements it.

Jan 19, 2026

Stateful by Design: The Missing Serverless Abstraction

Serverless removed servers from the developer's mental model, but it also removed state from first-class support. For data-intensive workloads, that tradeoff is increasingly costly. We built memrun to prove that warm, locality-aware workers can remain serverless in feel while eliminating the state tax that stateless platforms impose.

Dec 26, 2025

Local-First Architecture: Faster Dev Loops, Lower Cloud Bills

Running faithful cloud infrastructure replicas on developer machines can significantly reduce inner-loop latency and cut non-production cloud spend. This article examines what local-first means in an infrastructure context, what the data supports, where the approach breaks down, and how to adopt it without cargo-culting.

Aug 31, 2024

© 2026

About RSS Tags