Gang Tim Liu created HIVE-3890: ---------------------------------- Summary: Hive List Bucketing - merge per skewed dir Key: HIVE-3890 URL: https://issues.apache.org/jira/browse/HIVE-3890 Project: Hive Issue Type: Improvement Reporter: Gang Tim Liu Assignee: Gang Tim Liu
Right now, in list bucketing DML, if it involves merge, it uses 1 MR job for all skewed directory. If no. of files is big, it might triggers hive client side OOM due to too many spits. If we use 1 MR job for one skewed dir, it will reduce OOM risks. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira