[ https://issues.apache.org/jira/browse/SPARK-3395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael Armbrust resolved SPARK-3395. ------------------------------------- Resolution: Fixed Fix Version/s: 1.2.0 > {SQL] DSL uses incorrect attribute ids after a distinct() > --------------------------------------------------------- > > Key: SPARK-3395 > URL: https://issues.apache.org/jira/browse/SPARK-3395 > Project: Spark > Issue Type: Bug > Components: SQL > Reporter: Eric Liang > Assignee: Eric Liang > Priority: Minor > Fix For: 1.2.0 > > > In the following example, > val rdd = ... // two columns: {key, value} > val derivedRDD = rdd.distinct().limit(1) > sql("explain select * from rdd inner join derivedRDD on rdd.key = > derivedRDD.key") > The inner join executes incorrectly since the two keys end up with the same > attribute id after analysis. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org