[
https://issues.apache.org/jira/browse/KAFKA-5578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bruno Cadonna resolved KAFKA-5578.
----------------------------------
Resolution: Duplicate
> Streams Task Assignor should consider the staleness of state directories when
> allocating tasks
> ----------------------------------------------------------------------------------------------
>
> Key: KAFKA-5578
> URL: https://issues.apache.org/jira/browse/KAFKA-5578
> Project: Kafka
> Issue Type: Bug
> Components: streams
> Reporter: Damian Guy
> Priority: Major
>
> During task assignment we use the presence of a state directory to assign
> precedence to which instances should be assigned the task. We first chose
> previous active tasks, but then fall back to the existence of a state dir.
> Unfortunately we don't take into account the recency of the data from the
> available state dirs. So in the case where a task has run on many instances,
> it may be that we chose an instance that has relatively old data.
> When doing task assignment we should take into consideration the age of the
> data in the state dirs. We could use the data from the checkpoint files to
> determine which instance is most up-to-date and attempt to assign accordingly
> (obviously making sure that tasks are still balanced across available
> instances)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)