Wrong results with MAPJOIN when cols from non-MAPJOINed table are selected
--------------------------------------------------------------------------

                 Key: HIVE-1682
                 URL: https://issues.apache.org/jira/browse/HIVE-1682
             Project: Hadoop Hive
          Issue Type: Bug
    Affects Versions: 0.7.0
         Environment: Hive trunk (rev 1003407)
Hadoop 20.2
            Reporter: Thiruvel Thirumoolan


Results of this query is wrong:

set hive.mapjoin.cache.numrows=100;
select /*+ MAPJOIN(invites) */ pokes.bar from pokes join invites on (pokes.bar 
= invites.bar);

Results of all the queries below match:

/* This is the same as problematic query without specifying numrows - which 
defaults to 25k much greater than the number of rows in pokes table */
select /*+ MAPJOIN(invites) */ pokes.bar from pokes join invites on (pokes.bar 
= invites.bar)

set hive.mapjoin.cache.numrows=100;
select /*+ MAPJOIN(invites) */ invites.bar from pokes join invites on 
(pokes.bar = invites.bar);

select invites.bar from pokes join invites on (pokes.bar = invites.bar);

select pokes.bar from pokes join invites on (pokes.bar = invites.bar);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to