[ 
https://issues.apache.org/jira/browse/SPARK-8319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-8319:
-----------------------------------

    Assignee: Josh Rosen  (was: Apache Spark)

> Update logic related to key ordering in shuffle dependencies
> ------------------------------------------------------------
>
>                 Key: SPARK-8319
>                 URL: https://issues.apache.org/jira/browse/SPARK-8319
>             Project: Spark
>          Issue Type: Improvement
>          Components: Shuffle, SQL
>            Reporter: Josh Rosen
>            Assignee: Josh Rosen
>
> The Tungsten ShuffleManager falls back to regular SortShuffleManager whenever 
> the shuffle dependency specifies a key ordering, but technically we only need 
> to fall back when an aggregator is also specified.  We should update the 
> fallback logic to handle this case so that the Tungsten optimizations can 
> apply to more workloads.
> I also noticed that the SQL Exchange operator performs defensive copying of 
> shuffle inputs when a key ordering is specified, but this is unnecessary: the 
> only shuffle manager that performs sorting on the map side is 
> SortShuffleManager, and it only performs sorting if an aggregator is 
> specified.  SQL never uses Spark's shuffle for performing aggregation, so 
> this copying is unnecessary.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to