krivaltsevich.com Kafka Internals4.4

Apache Kafka Internals

A deep, source-derived guide to Apache Kafka in three parts: how it works inside, how to operate it at scale, and what the distributed log teaches us as an architectural blueprint.

Apache Kafka 4.4.0-SNAPSHOT · git 04bfe7d · 2026-06-15 Derived from source · not copied from official docs 3 parts · 46 chapters

New here? Start with the Architecture Overview. Operating a cluster? Jump to Part II, Operations. Designing a system? See Part III, The Log as a Blueprint.

Part I · Architecture Internals

How Kafka actually works inside, from the on-disk byte layout of a record batch up to the KRaft controller quorum, the coordinators, and the client runtimes.

Part II · Operations Manual

How to run it: limits, tuning, capacity & partition sizing, failure runbooks, the signals to watch, cost, and what changes at 1M / 10M / 100M events per second.

Part III · The Log as a Blueprint

Kafka as one implementation of the distributed-log pattern, when to choose it, its inherent tradeoffs, the reusable engineering tactics, and the design space.

krivaltsevich.com · Part of Apache Kafka Internals · derived from Apache Kafka 4.4 source · GitHub · MIT-licensed.

Apache Kafka® is a registered trademark of the Apache Software Foundation. This is an independent, unofficial guide, not affiliated with or endorsed by the ASF.