Skip to content

May 2026

AI Inference Needs a Global Resilience Layer

Background: AI Inference

First, some background. What is AI inference?

When you ask ChatGPT "Can you explain [X] to me?", what happens? A server somewhere converts that question to tokens and passes them through some trained model. The model performs a lot of calculations, generating a response, token by token, until it has a complete answer, which it then somehow delivers to you.

Note: we're not talking about training a model here; rather, about using an already-trained model.

K8gb local setup beginners guide

Blog Banner

A Beginner-Friendly Guide to k8gb Local Setup

When I first started working with k8gb, the local setup took me a while to fully wrap my head around. And I'm a contributor to this project! So if you're new to k8gb and the local setup feels like a lot to take in, trust me — that's completely normal.

The local setup simulates a real multi-cluster, multi-region environment on your laptop. That's genuinely powerful, but it also means there's more moving parts than a typical "run one cluster" tutorial. Once the mental model clicks, everything falls into place.

So I decided to write the walkthrough I wish I had when I was getting started.