marymwu created HIVE-14160:
------------------------------

             Summary: Reduce-task costs a long time to finish on the condition 
that the certain sql "select a,distinct(b) group by a" has been executed on the 
data which has skew distribution
                 Key: HIVE-14160
                 URL: https://issues.apache.org/jira/browse/HIVE-14160
             Project: Hive
          Issue Type: Improvement
          Components: hpl/sql
    Affects Versions: 1.1.0
            Reporter: marymwu


Reduce-task costs a long time to finish on the condition that the certain sql 
"select a,distinct(b) group by a" has been executed on the data which has skew 
distribution

data scale: 64G



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to