Aditya Kishore created DRILL-855:
------------------------------------

             Summary: Improve work assignment parallelization
                 Key: DRILL-855
                 URL: https://issues.apache.org/jira/browse/DRILL-855
             Project: Apache Drill
          Issue Type: Improvement
            Reporter: Aditya Kishore
            Assignee: Aditya Kishore


Currently if only a small number of endpoints have data affinity, all the work 
units get assigned to this small subset leaving other endpoints in the cluster 
underutilized.

It will lead to poor query performance since the level of parallelization is 
reduced below what is otherwise possible.

This JIRA intend to modify the assignment to do a slightly more equitable 
distribution of work fragments among all available nodes while slightly 
favoring the nodes with data affinity.

This \*favor\* is controlled by the setting 
{{"drill.exec.work.affinity.factor"}} (defaults to 1.2f).

As an example, let's say that there are 2 endpoints of which only one is with 
affinity and 10 work fragments. In the current scheme of things, all of these 
work fragments will get assigned to the single node with affinity. With the 
modified scheme and with default affinity factor of 1.2, it will get 20% more 
assignment that it would get under equal distribution (i.e. 6:4 instead of 5:5).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to