[ https://issues.apache.org/jira/browse/SPARK-23195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xiao Li resolved SPARK-23195. ----------------------------- Resolution: Fixed Fix Version/s: 2.3.0 > Hint of cached data is lost > --------------------------- > > Key: SPARK-23195 > URL: https://issues.apache.org/jira/browse/SPARK-23195 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.2.1, 2.3.0 > Reporter: Xiao Li > Assignee: Xiao Li > Priority: Major > Fix For: 2.3.0 > > > {noformat} > withSQLConf(SQLConf.AUTO_BROADCASTJOIN_THRESHOLD.key -> "-1") { > val df1 = spark.createDataFrame(Seq((1, "4"), (2, "2"))).toDF("key", > "value") > val df2 = spark.createDataFrame(Seq((1, "1"), (2, "2"))).toDF("key", > "value") > broadcast(df2).cache() > df2.collect() > val df3 = df1.join(df2, Seq("key"), "inner") > val numBroadCastHashJoin = df3.queryExecution.executedPlan.collect { > case b: BroadcastHashJoinExec => b > }.size > assert(numBroadCastHashJoin === 1) > } > {noformat} > The broadcast hint is not respected. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org