Wrong results with MAPJOIN when cols from non-MAPJOINed table are selected --------------------------------------------------------------------------
Key: HIVE-1682 URL: https://issues.apache.org/jira/browse/HIVE-1682 Project: Hadoop Hive Issue Type: Bug Affects Versions: 0.7.0 Environment: Hive trunk (rev 1003407) Hadoop 20.2 Reporter: Thiruvel Thirumoolan Results of this query is wrong: set hive.mapjoin.cache.numrows=100; select /*+ MAPJOIN(invites) */ pokes.bar from pokes join invites on (pokes.bar = invites.bar); Results of all the queries below match: /* This is the same as problematic query without specifying numrows - which defaults to 25k much greater than the number of rows in pokes table */ select /*+ MAPJOIN(invites) */ pokes.bar from pokes join invites on (pokes.bar = invites.bar) set hive.mapjoin.cache.numrows=100; select /*+ MAPJOIN(invites) */ invites.bar from pokes join invites on (pokes.bar = invites.bar); select invites.bar from pokes join invites on (pokes.bar = invites.bar); select pokes.bar from pokes join invites on (pokes.bar = invites.bar); -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.