[jira] [Commented] (SPARK-49442) Complete Metadata requests on each micro batch causing Kafka issues

Jungtaek Lim (Jira) Wed, 28 Aug 2024 06:46:55 -0700


    [ 
https://issues.apache.org/jira/browse/SPARK-49442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877397#comment-17877397
 ]


Jungtaek Lim commented on SPARK-49442:
--------------------------------------

OK, that's unrelated. We haven't got any report for this kind of complaint. I 
recommend you to provide a minimal reproducer e.g. Apache Spark cluster & 
Apache Kafka cluster (no vendor version and no cloud service version), topic 
partition to 3-5 and increase topic-partition and prove that the metadata 
requests increase linearly, with the detailed explanation about how you capture 
the requests. If you are relying on any vendor rather than building the cluster 
on your own, it'd be ideal to contact with the support.

> Complete Metadata requests on each micro batch causing Kafka issues
> -------------------------------------------------------------------
>
>                 Key: SPARK-49442
>                 URL: https://issues.apache.org/jira/browse/SPARK-49442
>             Project: Spark
>          Issue Type: Bug
>          Components: Structured Streaming
>    Affects Versions: 3.3.2
>            Reporter: vipin Kumar
>            Priority: Major
>              Labels: Kafka, spark-streaming-kafka
>
> We have noticed that spark does complete metadata requests on each micro 
> batch and this is causing high metadata requests on small micro batch 
> intervals .
>  
> For example Kafka with 1900 partitions and 10 sec micro batch we are seeing 
> order of 
> ~{*}360K{*} metadata requests / sec 
> Same with job with 60 sec micro batch we are observing *~60K* meta data 
> requests.
>  
> Metadata requests are controlled by *metadata.max.age.ms* but these config 
> have no effect on spark consumers by default its 5 mins still we are seeing 
> these huge requests.
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-49442) Complete Metadata requests on each micro batch causing Kafka issues

Reply via email to