Vlad Rozov created DRILL-6030:
---------------------------------

             Summary: Managed sort should minimize number of batches in a k-way 
merge
                 Key: DRILL-6030
                 URL: https://issues.apache.org/jira/browse/DRILL-6030
             Project: Apache Drill
          Issue Type: Improvement
            Reporter: Vlad Rozov
            Assignee: Vlad Rozov


The time complexity of the algorithm is O(n*k*log(k)) where k is a number of 
batches to merge and n is a number of records in each batch (assuming equal 
size batches). As n*k is the total number of record to merge and it can be 
quite large, minimizing k should give better results.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to