when i try to open sequence file: val t2 = sc.sequenceFile("/user/hdfs/e1Mseq", classOf[String], classOf[String]) t2.groupByKey().take(5)
I get: org.apache.spark.SparkException: Job aborted: Task 25.0:0 had a not serializable result: java.io.NotSerializableException: org.apache.hadoop.io.Text another thing is: t2.take(5) - returns 5 identical items, i guess I have to map/clone items, but i get something like org.apache.hadoop.io.Text cannot be cast to java.lang.String, how do i clone it? Thanks. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/sequenceFile-and-groupByKey-tp2428.html Sent from the Apache Spark User List mailing list archive at Nabble.com.