holdenk created SPARK-25360: ------------------------------- Summary: Parallelized RDDs of Ranges could have known partitioner Key: SPARK-25360 URL: https://issues.apache.org/jira/browse/SPARK-25360 Project: Spark Issue Type: Improvement Components: Spark Core Affects Versions: 2.4.0 Reporter: holdenk
We already have the logic to split up the generator, we could expose the same logic as a partitioner. This would be useful when joining a small parallelized collection with a larger collection and other cases. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org