KaiXinXIaoLei created SPARK-23564:
-------------------------------------

             Summary: the  optimized logical plan about Left anti join should 
be further optimization
                 Key: SPARK-23564
                 URL: https://issues.apache.org/jira/browse/SPARK-23564
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.3.0
            Reporter: KaiXinXIaoLei


The Optimized Logical Plan of the query '*select * from tt1 left anti join tt2 
on tt2.i = tt1.i*' is 

 
{code:java}
== Optimized Logical Plan ==
Join LeftAnti, (i#2 = i#0)
:- HiveTableRelation `default`.`tt1`, 
org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#0, s#1]
+- Project [i#2]
+- HiveTableRelation `default`.`tt2`, 
org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#2, s#3]
{code}
 

 

this plan can be further optimization by 'Filter isnotnull' of right table, as 
follow:
{code:java}
== Optimized Logical Plan ==
Join LeftAnti, (i#2 = i#0)
:- HiveTableRelation `default`.`tt1`, 
org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#0, s#1]
+- Project [i#2]
  +- Filter isnotnull(i#3)
    +- HiveTableRelation `default`.`tt2`, 
org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#2, s#3]
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to