[ 
https://issues.apache.org/jira/browse/HIVE-29574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-29574 started by Illia Barbashov.
----------------------------------------------
> When the skew is found in Tez task at merge join, have a threshold for join 
> dup keys
> ------------------------------------------------------------------------------------
>
>                 Key: HIVE-29574
>                 URL: https://issues.apache.org/jira/browse/HIVE-29574
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Illia Barbashov
>            Assignee: Illia Barbashov
>            Priority: Major
>
> When Skew is found in single tez task at merge join, 
>  * Have a threshold (max rows per join key) check
>  * When the join produces more than threshold,
>  ** Either kill the task & while throwing the exception, log the join key 
> responsible for causing skew explode.
>  ** Or at very minimum, log warning about the join key producing huge skew in 
> single task.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to