Cassandra Architecture Compaction Fundamentals
Storage-engine internals, LSM-tree mechanics, compaction strategies, token-ring distribution, gossip, repair, and tombstone lifecycle.
Production-grade playbooks for tuning compaction, scheduling repair, and automating node lifecycles in Apache Cassandra 4.x / 5.x.
Compaction Strategy is a focused, no-fluff resource for the teams who keep Cassandra clusters healthy under load. Every guide is anchored in real operational constraints: I/O backpressure, repair cadence, tombstone lifecycles, gossip convergence, and the disk-headroom math that decides whether a strategy change is safe to ship.
The material is organized into two tracks. Architecture & Fundamentals covers
the storage engine itself — LSM-tree mechanics, STCS vs LCS vs TWCS, token-ring distribution,
gossip and failure detection, read-repair vs anti-entropy repair, and tombstone garbage
collection. Tuning & Monitoring turns that theory into operations: backlog
alerting thresholds, async compaction metrics, error categorization, read-path fallback
routing, and Python pipelines that wrap nodetool and JMX safely.
Code samples are idempotent, bounded, and written for automation pipelines rather than one-off shells — exactly what you need to operate multi-datacenter clusters within strict SLA boundaries.
Two tracks, from storage-engine internals to production automation. Pick a section to dive in.
Storage-engine internals, LSM-tree mechanics, compaction strategies, token-ring distribution, gossip, repair, and tombstone lifecycle.
Strategy tuning, backlog alerting, async metrics, error categorization, read-path fallback routing, and Python automation pipelines.