Emanuele Cesena created SPARK-8827: -------------------------------------- Summary: pyspark.DStream top method Key: SPARK-8827 URL: https://issues.apache.org/jira/browse/SPARK-8827 Project: Spark Issue Type: Improvement Components: PySpark, Streaming Reporter: Emanuele Cesena Priority: Minor
Is there a reason for not having DStream.top? Any issues with the following definition? ``` def topPartition(partition): return sorted(partition, key=lambda p: p[1], reverse=True)[:10] def top(counts): return counts.transform(lambda rdd: rdd.mapPartitions(topPartition) .sortBy(lambda p: p[1],ascending=False)) ``` -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org