Nicholas Chammas created SPARK-16427: ----------------------------------------
Summary: Expand documentation on the various RDD storage levels Key: SPARK-16427 URL: https://issues.apache.org/jira/browse/SPARK-16427 Project: Spark Issue Type: Improvement Components: Documentation Reporter: Nicholas Chammas Priority: Minor Looking at the docs here http://spark.apache.org/docs/1.6.2/api/python/pyspark.html#pyspark.StorageLevel A newcomer to Spark won’t understand the meaning of {{_2}}, or the meaning of {{_SER}} (or its value), and won’t understand how exactly memory and disk play together when something like {{MEMORY_AND_DISK}} is selected. We should expand this documentation to explain what the various levels mean and perhaps even when a user might want to use them. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org