Apache Kafka Internals
A deep, source-derived guide to Apache Kafka in three parts: how it works inside, how to operate it at scale, and what the distributed log teaches us as an architectural blueprint.
New here? Start with the Architecture Overview. Operating a cluster? Jump to Part II, Operations. Designing a system? See Part III, The Log as a Blueprint.
Part I · Architecture Internals
How Kafka actually works inside, from the on-disk byte layout of a record batch up to the KRaft controller quorum, the coordinators, and the client runtimes.
Part II · Operations Manual
How to run it: limits, tuning, capacity & partition sizing, failure runbooks, the signals to watch, cost, and what changes at 1M / 10M / 100M events per second.
Part III · The Log as a Blueprint
Kafka as one implementation of the distributed-log pattern, when to choose it, its inherent tradeoffs, the reusable engineering tactics, and the design space.