KIP-1008 — ParKa - the Marriage of Parquet and Kafka
Discussion ProtocolBroker
Proposes a `ParKa` format that serializes Kafka record batches in Apache Parquet columnar format rather than row-by-row Avro/JSON/Protobuf. Columnar encoding yields dramatically better compression ratios for large batches and enables predicate pushdown for consumers reading from tiered storage.
Details
| Author | Xinli Shang |
| Status | Discussion |
| Wiki | View on Apache Wiki |
| Created | 2023-11-21 |
| Last Modified | 2023-12-02 |
Explore how this KIP affects the Kafka protocol in the Protocol Explorer, or see the full KIP database.