Thank you very much for your kindly help. I rise some another questions:

   - If the RDD is stored in serialized format, is that means that whenever
the RDD is processed, it will be unpacked and packed again from and back to
the JVM even they are located on the same machine? 
<http://apache-spark-user-list.1001560.n3.nabble.com/file/n13862/rdd_img.png> 

   - Can the RDD be partially unpacked from the serialized state? or when
every a RDD is touched, it must be fully unpacked, and of course pack again
afterword.

  -  When a RDD is cached, is it saved in a unserialized format or
serialized format? If it's saved in a unserialized format, is the partially
reading of RDD from JVM to PYTHON runtime possible? 

Thank you very much



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/RDD-memory-questions-tp13805p13862.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to