[ https://issues.apache.org/jira/browse/SPARK-2871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14091986#comment-14091986 ]
Josh Rosen commented on SPARK-2871: ----------------------------------- There's actually an open PR for this that's currently being reviewed (odd that it wasn't automatically linked): https://github.com/apache/spark/pull/1791 > Missing API in PySpark > ---------------------- > > Key: SPARK-2871 > URL: https://issues.apache.org/jira/browse/SPARK-2871 > Project: Spark > Issue Type: Improvement > Reporter: Davies Liu > > There are several APIs missing in PySpark: > RDD.collectPartitions() > RDD.histogram() > RDD.zipWithIndex() > RDD.zipWithUniqueId() > RDD.min(comp) > RDD.max(comp) > A bunch of API related to approximate jobs. -- This message was sent by Atlassian JIRA (v6.2#6252) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org