when i try to open sequence file:
val t2 = sc.sequenceFile("/user/hdfs/e1Mseq", classOf[String],
classOf[String])
t2.groupByKey().take(5)

I get:
org.apache.spark.SparkException: Job aborted: Task 25.0:0 had a not
serializable result: java.io.NotSerializableException:
org.apache.hadoop.io.Text

another thing is:
t2.take(5) - returns 5 identical items, i guess I have to map/clone items,
but i get something like org.apache.hadoop.io.Text cannot be cast to
java.lang.String, how do i clone it?

Thanks.



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/sequenceFile-and-groupByKey-tp2428.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

Reply via email to