Xiao Li created SPARK-23192:
-------------------------------

             Summary: Hint is lost after using cached data
                 Key: SPARK-23192
                 URL: https://issues.apache.org/jira/browse/SPARK-23192
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 2.2.1, 2.3.0
            Reporter: Xiao Li
            Assignee: Xiao Li


The hint of the plan segment is lost, if the plan segment is replaced by the 
cached data. 

{noformat}
      val df1 = spark.createDataFrame(Seq((1, "4"), (2, "2"))).toDF("key", 
"value")
      val df2 = spark.createDataFrame(Seq((1, "1"), (2, "2"))).toDF("key", 
"value")
      df2.cache()
      val df3 = df1.join(broadcast(df2), Seq("key"), "inner")
{noformat}

Hint is lost in {{df3}}. The physical join algorithm will not respect the hint 
due to the loss.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to