Github user Tagar commented on the pull request: https://github.com/apache/spark/pull/1082#issuecomment-113381555 Would be gread to have this implemented in PySpark as well. Very handy in setups like Jupyter where we have a lot of RDDs declared in a Spark Notebook, and its hard to tell where is memory consumed. UnpersistAll isn't really a solution, as if we rerun all the cells, we're back to square one.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org