[ https://issues.apache.org/jira/browse/SPARK-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Patrick Wendell updated SPARK-911: ---------------------------------- Description: If someone has sorted a (K, V) rdd, we should offer them a way to filter a range of the partitions that employs map pruning. This would be simple using a small range index within the rdd itself. A good example is I sort my dataset by time and then I want to serve queries that are restricted to a certain time range. (was: \[Tentatively assigned to me, but anyone can do this if they'd like!\] If someone has sorted a (K, V) rdd, we should offer them a way to filter a range of the partitions that employs map pruning. This would be simple using a small range index within the rdd itself. A good example is I sort my dataset by time and then I want to serve queries that are restricted to a certain time range.) > Support map pruning on sorted (K, V) RDD's > ------------------------------------------ > > Key: SPARK-911 > URL: https://issues.apache.org/jira/browse/SPARK-911 > Project: Spark > Issue Type: Bug > Reporter: Patrick Wendell > > If someone has sorted a (K, V) rdd, we should offer them a way to filter a > range of the partitions that employs map pruning. This would be simple using > a small range index within the rdd itself. A good example is I sort my > dataset by time and then I want to serve queries that are restricted to a > certain time range. -- This message was sent by Atlassian JIRA (v6.2#6252)