[ 
https://issues.apache.org/jira/browse/HIVE-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900941#action_12900941
 ] 

Ning Zhang commented on HIVE-741:
---------------------------------

The SMB test case still has a minor issue: the tables was created as 2 buckets 
but there is only 1 file in the tables. This is conflicting to the table 
schema. If a table is defined as bucketd 2, there should be 2 files in the 
partition or table. They SMB join takes the 1st file in T1 join the 1st file in 
T2, and 2nd file in T1 join 2nd file in T2. So the test case should cover this 
use case. 

> NULL is not handled correctly in join
> -------------------------------------
>
>                 Key: HIVE-741
>                 URL: https://issues.apache.org/jira/browse/HIVE-741
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Ning Zhang
>            Assignee: Amareshwari Sriramadasu
>         Attachments: patch-741-1.txt, patch-741-2.txt, patch-741-3.txt, 
> patch-741.txt, smbjoin_nulls.q.txt
>
>
> With the following data in table input4_cb:
> Key        Value
> ------       --------
> NULL     325
> 18          NULL
> The following query:
> {code}
> select * from input4_cb a join input4_cb b on a.key = b.value;
> {code}
> returns the following result:
> NULL    325    18   NULL
> The correct result should be empty set.
> When 'null' is replaced by '' it works.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to