Kevin Mader created SPARK-4698: ---------------------------------- Summary: Data-locality aware Partitioners Key: SPARK-4698 URL: https://issues.apache.org/jira/browse/SPARK-4698 Project: Spark Issue Type: Improvement Components: Spark Core Reporter: Kevin Mader Priority: Minor
The current hash and range partitioner tools do not seem to respect the existing data-locality. A 'dictionary' driven partitioner that calculated the partitions based on the existing key locations instead of re-calculating them would be ideal. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org