Illia Barbashov created HIVE-29574:
--------------------------------------
Summary: When the skew is found in Tez task at merge join, have a
threshold for join dup keys
Key: HIVE-29574
URL: https://issues.apache.org/jira/browse/HIVE-29574
Project: Hive
Issue Type: Bug
Reporter: Illia Barbashov
When Skew is found in single tez task at merge join,
* Have a threshold (max rows per join key) check
* When the join produces more than threshold,
** Either kill the task & while throwing the exception, log the join key
responsible for causing skew explode.
** Or at very minimum, log warning about the join key producing huge skew in
single task.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)