[ 
https://issues.apache.org/jira/browse/SPARK-9744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14679857#comment-14679857
 ] 

Sean Owen commented on SPARK-9744:
----------------------------------

join returns something different than cogroup. It returns all the combinations 
of (W1,W2) not iterables over the values individually.

Here's how to make a PR: 
https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark
But yes if there is suitable functionality in DataFrames, try it again and/or 
look at fixing it if there is a problem. 

> Add RDD method to map with lag and lead
> ---------------------------------------
>
>                 Key: SPARK-9744
>                 URL: https://issues.apache.org/jira/browse/SPARK-9744
>             Project: Spark
>          Issue Type: Wish
>          Components: Spark Core
>            Reporter: Jerry Z
>            Priority: Minor
>
> To avoid zipping with index and doing numerous mapping and joins, having a 
> single method call to map with an additional two parameters (1: list of 
> offsets [(-) for lag, 0 for current and (+) for lead])) and (2:default 
> value). The other difference to the map function takes an argument of List<T> 
> and not just T.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to