Architecture Advice: Best practices for consuming Kafka streams into Impala at 500k QPS

汲广熙 Sun, 11 Jan 2026 22:29:51 -0800

Hi Impala Devs,

I am planning a production pipeline to ingest data from Kafka into 
Impala&nbsp;with a high throughput of approximately 500,000 QPS. Given the 
metadata overhead and file management constraints in Impala, I would like to 
get your recommendations on the most robust architecture.


My Current Environment:


Impala Version:&nbsp;4.5.0



Storage:&nbsp;Tencent Cloud COS (Object Storage)



Table Format:&nbsp;Apache Iceberg

Architecture Advice: Best practices for consuming Kafka streams into Impala at 500k QPS

Reply via email to