[ 
https://issues.apache.org/jira/browse/SPARK-17734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15546303#comment-15546303
 ] 

Leif Warner commented on SPARK-17734:
-------------------------------------

That works, I was just hoping for a helper method that does essentially that, 
similar to how you can say `table1.join(table2, "value")` right now and get a 
DataFrame. Just to be more concise and less error-prone.

> inner equi-join shorthand that returns Datasets, like DataFrame already has
> ---------------------------------------------------------------------------
>
>                 Key: SPARK-17734
>                 URL: https://issues.apache.org/jira/browse/SPARK-17734
>             Project: Spark
>          Issue Type: Wish
>            Reporter: Leif Warner
>            Priority: Minor
>
> There's an existing ".join(right: Dataset[_], usingColumn: String): 
> DataFrame" method on Dataset.
> Would appreciate it if a variant that returns typed Datasets would also 
> available.
> If you write a join that contains the common column name name, you get an 
> AnalysisException thrown because that's ambiguous, e.g:
> $"foo" === $"foo"
> So I wrote table1.toDF()("foo") === table2.toDF()("foo"), but that's a little 
> error prone, and coworkers considered it a hack and didn't want to use it, 
> because it "mixes DataFrame and Dataset api".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to