I get the key point . The problem is in sc.sequenceFile,From API description "RDD will create many references to the same objecty" ,So I revise the code "sessions.getBytes" to "sessions.getBytes.clone", It seems to work. Thanks.
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/toArray-first-get-the-different-result-from-one-element-RDD-tp20734p20739.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org