Hi Colin, A quick comment.
On Sat, May 4, 2019 at 11:18 PM Colin McCabe <cmcc...@apache.org> wrote: > The big advantage of doing batching on the controller is that the > controller has more information about what is going on in the cluster. So > it can schedule reassignments in a more optimal way. For instance, it can > schedule reassignments so that the load is distributed evenly across > nodes. This advantage is lost if we have to adhere to a rigid ordering > that is set up in advance. We don't know exactly when anything will > complete in any case. Just because one partition reassignment was started > before another doesn't mean it will finish before another. This is not quite true, right? The Controller doesn't know about partition sizes, throughput per partition and other such information that external tools like Cruise Control track. Ismael