[ 
https://issues.apache.org/jira/browse/SPARK-17734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15541870#comment-15541870
 ] 

Dongjoon Hyun commented on SPARK-17734:
---------------------------------------

Hi, [~pdxleif].
How do you think about the following?
```
table1.join(table2, table1.col("value") === table2.col("value"))
```

> inner equi-join shorthand that returns Datasets, like DataFrame already has
> ---------------------------------------------------------------------------
>
>                 Key: SPARK-17734
>                 URL: https://issues.apache.org/jira/browse/SPARK-17734
>             Project: Spark
>          Issue Type: Wish
>            Reporter: Leif Warner
>            Priority: Minor
>
> There's an existing ".join(right: Dataset[_], usingColumn: String): 
> DataFrame" method on Dataset.
> Would appreciate it if a variant that returns typed Datasets would also 
> available.
> If you write a join that contains the common column name name, you get an 
> AnalysisException thrown because that's ambiguous, e.g:
> $"foo" === $"foo"
> So I wrote table1.toDF()("foo") === table2.toDF()("foo"), but that's a little 
> error prone, and coworkers considered it a hack and didn't want to use it, 
> because it "mixes DataFrame and Dataset api".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to