[ 
https://issues.apache.org/jira/browse/KAFKA-19426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

fujian updated KAFKA-19426:
---------------------------
    Description: 
 
currently. the default value don't make sure the retry can always happen with 
successful

For example: There are possible two timeout (2 * 60s) happen at 
TopicBasedRemoteLogMetadataManager#initializeResources
[2025-06-03 21:57:21,151] INFO Topic __remote_log_metadata does not exist. 
Error: Timed out waiting for a node assignment. Call: listNodes at
[2025-06-03 21:58:21,153] ERROR Encountered error while creating 
__remote_log_metadata topic. java.util.concurrent.ExecutionException: 
org.apache.kafka.common.errors.TimeoutException: Timed out waiting for a node 
assignment. Call: createTopics at

 

this original default configure value request the server must be ready within 2 
minutes. Not every broker can reach this requirement.

So propose the [KIP-1197| 
https://cwiki.apache.org/confluence/display/KAFKA/KIP-1197%3A+Introduce+new+method+to+improve+the+TopicBasedRemoteLogMetadataManager%27s+initialization]

 

  was:
 
currently. the default value don't make sure the retry can always happen.

For example: There are possible two timeout (2 * 60s) happen at 
TopicBasedRemoteLogMetadataManager#initializeResources
[2025-06-03 21:57:21,151] INFO Topic __remote_log_metadata does not exist. 
Error: Timed out waiting for a node assignment. Call: listNodes at
[2025-06-03 21:58:21,153] ERROR Encountered error while creating 
__remote_log_metadata topic. java.util.concurrent.ExecutionException: 
org.apache.kafka.common.errors.TimeoutException: Timed out waiting for a node 
assignment. Call: createTopics at

And the default 2 minutes is very small so that it won't happen any retry in 
this case.

At the sometimes. this original default configure value request the server must 
be ready within 2 minutes. Not every broker can reach this requirement.

So submit the change to increase the default value to 5 minutes.

the PR:  [Correct the TopicBasedRemoteLogMetadataManager#initializeResources' 
retry default configure value to reasonable one by jiafu1115 · Pull Request 
#20008 · apache/kafka|https://github.com/apache/kafka/pull/20008]


> TopicBasedRemoteLogMetadataManager's initial should happen after the broker 
> ready
> ---------------------------------------------------------------------------------
>
>                 Key: KAFKA-19426
>                 URL: https://issues.apache.org/jira/browse/KAFKA-19426
>             Project: Kafka
>          Issue Type: Bug
>          Components: Tiered-Storage
>    Affects Versions: 3.8.1, 3.9.0, 4.0.0
>            Reporter: fujian
>            Priority: Major
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
>  
> currently. the default value don't make sure the retry can always happen with 
> successful
> For example: There are possible two timeout (2 * 60s) happen at 
> TopicBasedRemoteLogMetadataManager#initializeResources
> [2025-06-03 21:57:21,151] INFO Topic __remote_log_metadata does not exist. 
> Error: Timed out waiting for a node assignment. Call: listNodes at
> [2025-06-03 21:58:21,153] ERROR Encountered error while creating 
> __remote_log_metadata topic. java.util.concurrent.ExecutionException: 
> org.apache.kafka.common.errors.TimeoutException: Timed out waiting for a node 
> assignment. Call: createTopics at
>  
> this original default configure value request the server must be ready within 
> 2 minutes. Not every broker can reach this requirement.
> So propose the [KIP-1197| 
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-1197%3A+Introduce+new+method+to+improve+the+TopicBasedRemoteLogMetadataManager%27s+initialization]
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to