Illia Barbashov created HIVE-29574:
--------------------------------------

             Summary: When the skew is found in Tez task at merge join, have a 
threshold for join dup keys
                 Key: HIVE-29574
                 URL: https://issues.apache.org/jira/browse/HIVE-29574
             Project: Hive
          Issue Type: Bug
            Reporter: Illia Barbashov


When Skew is found in single tez task at merge join, 
 * Have a threshold (max rows per join key) check

 * When the join produces more than threshold,

 ** Either kill the task & while throwing the exception, log the join key 
responsible for causing skew explode.

 ** Or at very minimum, log warning about the join key producing huge skew in 
single task.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to