[
https://issues.apache.org/jira/browse/HIVE-29574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Illia Barbashov reassigned HIVE-29574:
--------------------------------------
Assignee: Illia Barbashov
> When the skew is found in Tez task at merge join, have a threshold for join
> dup keys
> ------------------------------------------------------------------------------------
>
> Key: HIVE-29574
> URL: https://issues.apache.org/jira/browse/HIVE-29574
> Project: Hive
> Issue Type: Bug
> Reporter: Illia Barbashov
> Assignee: Illia Barbashov
> Priority: Major
>
> When Skew is found in single tez task at merge join,
> * Have a threshold (max rows per join key) check
> * When the join produces more than threshold,
> ** Either kill the task & while throwing the exception, log the join key
> responsible for causing skew explode.
> ** Or at very minimum, log warning about the join key producing huge skew in
> single task.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)