The browser is now a production AI runtime. This guide covers every layer — WebGPU acceleration, WebLLM and Transformers.js inference, Chrome's Gemini Nano APIs, small language models, agent reasoning loops, and the WebMCP standard — with working code.