[ https://issues.apache.org/jira/browse/SPARK-17734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15541870#comment-15541870 ]
Dongjoon Hyun commented on SPARK-17734: --------------------------------------- Hi, [~pdxleif]. How do you think about the following? ``` table1.join(table2, table1.col("value") === table2.col("value")) ``` > inner equi-join shorthand that returns Datasets, like DataFrame already has > --------------------------------------------------------------------------- > > Key: SPARK-17734 > URL: https://issues.apache.org/jira/browse/SPARK-17734 > Project: Spark > Issue Type: Wish > Reporter: Leif Warner > Priority: Minor > > There's an existing ".join(right: Dataset[_], usingColumn: String): > DataFrame" method on Dataset. > Would appreciate it if a variant that returns typed Datasets would also > available. > If you write a join that contains the common column name name, you get an > AnalysisException thrown because that's ambiguous, e.g: > $"foo" === $"foo" > So I wrote table1.toDF()("foo") === table2.toDF()("foo"), but that's a little > error prone, and coworkers considered it a hack and didn't want to use it, > because it "mixes DataFrame and Dataset api". -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org