[
https://issues.apache.org/jira/browse/KAFKA-17190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Divij Vaidya updated KAFKA-17190:
---------------------------------
Fix Version/s: 3.8.2
> AssignmentsManager gets stuck retrying on deleted topics
> --------------------------------------------------------
>
> Key: KAFKA-17190
> URL: https://issues.apache.org/jira/browse/KAFKA-17190
> Project: Kafka
> Issue Type: Bug
> Reporter: Colin McCabe
> Assignee: Colin McCabe
> Priority: Critical
> Fix For: 3.9.0, 3.7.2, 3.8.2
>
>
> In MetadataVersion 3.7-IV2 and above, on the broker, AssignmentsManager sends
> an RPC to the controller informing it about which directory we have chosen to
> place a replica on. Unfortunately, the code does not check to see if the
> topic still exists in the MetadataImage before sending the RPC. It will also
> retry infinitely. Therefore, when a topic is created and deleted in rapid
> succession, we can get stuck retrying the AssignReplicasToDirsRequest forever.
> In order to prevent this problem, the AssignmentsManager should check if a
> topic still exists (and is still present on the broker in question) before
> sending the RPC. In order to prevent log spam, we should not log any error
> messages until several minutes have gone past without success. Finally,
> rather than creating a new EventQueue event for each assignment request, we
> should simply modify a shared data structure and schedule a deferred event to
> send the accumulated RPCs. This will improve efficiency.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)