[ https://issues.apache.org/jira/browse/SPARK-20897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16026588#comment-16026588 ]
Michael Armbrust commented on SPARK-20897: ------------------------------------------ Is this a regression? If so, can you please make sure that its targeted at the 2.2.0 release. > cached self-join should not fail > -------------------------------- > > Key: SPARK-20897 > URL: https://issues.apache.org/jira/browse/SPARK-20897 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.2.0 > Reporter: Wenchen Fan > Assignee: Wenchen Fan > > code to reproduce this bug: > {code} > // force to plan sort merge join > spark.conf.set("spark.sql.autoBroadcastJoinThreshold", "0") > val df = Seq(1 -> "a").toDF("i", "j") > val df1 = df.as("t1") > val df2 = df.as("t2") > assert(df1.join(df2, $"t1.i" === $"t2.i").cache().count() == 1) > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org