Wrong results with MAPJOIN when cols from non-MAPJOINed table are selected
--------------------------------------------------------------------------
Key: HIVE-1682
URL: https://issues.apache.org/jira/browse/HIVE-1682
Project: Hadoop Hive
Issue Type: Bug
Affects Versions: 0.7.0
Environment: Hive trunk (rev 1003407)
Hadoop 20.2
Reporter: Thiruvel Thirumoolan
Results of this query is wrong:
set hive.mapjoin.cache.numrows=100;
select /*+ MAPJOIN(invites) */ pokes.bar from pokes join invites on (pokes.bar
= invites.bar);
Results of all the queries below match:
/* This is the same as problematic query without specifying numrows - which
defaults to 25k much greater than the number of rows in pokes table */
select /*+ MAPJOIN(invites) */ pokes.bar from pokes join invites on (pokes.bar
= invites.bar)
set hive.mapjoin.cache.numrows=100;
select /*+ MAPJOIN(invites) */ invites.bar from pokes join invites on
(pokes.bar = invites.bar);
select invites.bar from pokes join invites on (pokes.bar = invites.bar);
select pokes.bar from pokes join invites on (pokes.bar = invites.bar);
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.