Aman Sinha created DRILL-2900:
---------------------------------

             Summary: Joins are not getting parallelized in TestTpchDistributed 
unit tests
                 Key: DRILL-2900
                 URL: https://issues.apache.org/jira/browse/DRILL-2900
             Project: Apache Drill
          Issue Type: Bug
          Components: Query Planning & Optimization
    Affects Versions: 0.8.0
            Reporter: Aman Sinha
            Assignee: Jinfeng Ni


The suite of unit tests in org.apache.drill.TestTpchDistributed are supposed to 
have parallel joins, parallel aggregations and sort.  In order to force 
parallelism, the slice_target value is set to 10 (default is 100K).  However, 
it looks like the plans of at least a few queries that I tried (query 03, 05, 
14) did not have parallelism for the hash joins.  The aggregations and sort do 
have Exchanges, so they are getting parallelized.  

There is a likelihood that many  plans have BroadcastExchange on the right 
input of the join and since the left input of the join is typically a single 
file we may be dropping the exchange on the right input.  This should be fixed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to