Thank you very much for your kindly help. I rise some another questions: - If the RDD is stored in serialized format, is that means that whenever the RDD is processed, it will be unpacked and packed again from and back to the JVM even they are located on the same machine? <http://apache-spark-user-list.1001560.n3.nabble.com/file/n13862/rdd_img.png>
- Can the RDD be partially unpacked from the serialized state? or when every a RDD is touched, it must be fully unpacked, and of course pack again afterword. - When a RDD is cached, is it saved in a unserialized format or serialized format? If it's saved in a unserialized format, is the partially reading of RDD from JVM to PYTHON runtime possible? Thank you very much -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/RDD-memory-questions-tp13805p13862.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org