Github user tgravescs commented on the issue:

    https://github.com/apache/spark/pull/22112
  
    Personally I don't want to talk about implementation until we decide what 
we want our semantics to be around the unordered operations because that 
affects any implementation. 
    If we are saying we need to fix zip and any other unordered operation that 
means we don't really support unordered operations and everything needs to be 
sorted. 
    
    I would propose we fix the things that are using the round robin type 
partitioning (repartition) but then unordered things like zip/MapPartitions 
(via user code) we document or perhaps give the user the option to sort.  
    
    @mridulm  you caught the issues with zip and others and have said they need 
to be fixed, what are your thoughts?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to