[ https://issues.apache.org/jira/browse/MAPREDUCE-3768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vinod Kumar Vavilapalli updated MAPREDUCE-3768: ----------------------------------------------- Priority: Major (was: Blocker) Also, given MAPREDUCE-3809 is in, we believe this is fixed. Hence reducing the priority, please reprioritize if it isn't fixed yet. Thanks. > MR-2450 introduced a significant performance regression (Hive) > -------------------------------------------------------------- > > Key: MAPREDUCE-3768 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-3768 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: mrv2 > Affects Versions: 0.23.1 > Reporter: Eli Collins > Attachments: stopcommunicatorpatch.txt > > > MAPREDUCE-2450 introduced, or at least triggers, a significant performance > regression in Hive. With MR-2450 the execution time of TestCliDriver.skewjoin > goes from 2 minutes to 15 minutes. Reverting this change from the build fixes > the issue. > Here's the relevant query: > {noformat} > FROM src src1 JOIN src src2 ON (src1.key = src2.key) > INSERT OVERWRITE TABLE dest_j1 SELECT src1.key, src2.value; > {noformat} > You can reproduce this by running the following from Hive 8.0 against Hadoop > built from branch-23. > {noformat} > ant very-clean package test -Dtestcase=TestCliDriver -Dqfile=skewjoin.q > {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira