[ https://issues.apache.org/jira/browse/KAFKA-5578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Bruno Cadonna resolved KAFKA-5578. ---------------------------------- Resolution: Duplicate > Streams Task Assignor should consider the staleness of state directories when > allocating tasks > ---------------------------------------------------------------------------------------------- > > Key: KAFKA-5578 > URL: https://issues.apache.org/jira/browse/KAFKA-5578 > Project: Kafka > Issue Type: Bug > Components: streams > Reporter: Damian Guy > Priority: Major > > During task assignment we use the presence of a state directory to assign > precedence to which instances should be assigned the task. We first chose > previous active tasks, but then fall back to the existence of a state dir. > Unfortunately we don't take into account the recency of the data from the > available state dirs. So in the case where a task has run on many instances, > it may be that we chose an instance that has relatively old data. > When doing task assignment we should take into consideration the age of the > data in the state dirs. We could use the data from the checkpoint files to > determine which instance is most up-to-date and attempt to assign accordingly > (obviously making sure that tasks are still balanced across available > instances) -- This message was sent by Atlassian Jira (v8.3.4#803005)