[ 
https://issues.apache.org/jira/browse/SPARK-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14060677#comment-14060677
 ] 

Sean Owen commented on SPARK-2278:
----------------------------------

Where are keys replicated? I don't get that point. Keys can be what you like, 
as can values. Values don't need to contain keys to sortByKey for example. You 
can sort by fields of a composite key without copying anything. A function like 
"employee => employee.name" does not copy the value employee in order to access 
the name for sorting purposes.

groupBy does not (necessarily) operate on key-value pair data. You can easily 
group any value by some function of the value. groupByKey does of course.


> groupBy & groupByKey should support custom comparator
> -----------------------------------------------------
>
>                 Key: SPARK-2278
>                 URL: https://issues.apache.org/jira/browse/SPARK-2278
>             Project: Spark
>          Issue Type: New Feature
>          Components: Java API
>    Affects Versions: 1.0.0
>            Reporter: Hans Uhlig
>
> To maintain parity with MapReduce you should be able to specify a custom key 
> equality function in groupBy/groupByKey similar to sortByKey. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to