GitHub user MayureshGharat opened a pull request:
https://github.com/apache/samza/pull/622
Handle Repartitioning of Streams
Modified the Groupers to handle Repartitioning. This is an opt in feature
and can be enabled by setting the config : "auto.handle.repartition" to true.
This patch assumes that we add a new interface in the groupers that
provides access to the previous mapping of tasks to SSPs.
This assumes that the number of partitions are always increased by a factor
of 2.
This also assumes that the partitioning algorithm used by the producer is
"hash(key) % partitions"
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/MayureshGharat/samza
autoHandlingOfRepartitioning
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/samza/pull/622.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #622
----
commit 1895d29aed1994c95c5da61a9cc4779a4b07bfb8
Author: mgharat <gharatmayuresh15@...>
Date: 2018-09-05T02:04:25Z
Modified the Groupers to handle repartitioning. This is an opt in feature
and can be enabled by setting the config : auto.handle.repartition to true
----
---