conduktor.io ↗

KIP-1008 — ParKa - the Marriage of Parquet and Kafka

Discussion ProtocolBroker

Proposes a `ParKa` format that serializes Kafka record batches in Apache Parquet columnar format rather than row-by-row Avro/JSON/Protobuf. Columnar encoding yields dramatically better compression ratios for large batches and enables predicate pushdown for consumers reading from tiered storage.

Details

AuthorXinli Shang
StatusDiscussion
WikiView on Apache Wiki
Created2023-11-21
Last Modified2023-12-02
Explore how this KIP affects the Kafka protocol in the Protocol Explorer, or see the full KIP database.