[ https://issues.apache.org/jira/browse/SPARK-17734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15546303#comment-15546303 ]
Leif Warner commented on SPARK-17734: ------------------------------------- That works, I was just hoping for a helper method that does essentially that, similar to how you can say `table1.join(table2, "value")` right now and get a DataFrame. Just to be more concise and less error-prone. > inner equi-join shorthand that returns Datasets, like DataFrame already has > --------------------------------------------------------------------------- > > Key: SPARK-17734 > URL: https://issues.apache.org/jira/browse/SPARK-17734 > Project: Spark > Issue Type: Wish > Reporter: Leif Warner > Priority: Minor > > There's an existing ".join(right: Dataset[_], usingColumn: String): > DataFrame" method on Dataset. > Would appreciate it if a variant that returns typed Datasets would also > available. > If you write a join that contains the common column name name, you get an > AnalysisException thrown because that's ambiguous, e.g: > $"foo" === $"foo" > So I wrote table1.toDF()("foo") === table2.toDF()("foo"), but that's a little > error prone, and coworkers considered it a hack and didn't want to use it, > because it "mixes DataFrame and Dataset api". -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org