RE: DAGScheduler: Failed to run foreach

2014-06-24 Thread Sameer Tilak
Date: Mon, 23 Jun 2014 18:00:27 -0700 Subject: Re: DAGScheduler: Failed to run foreach To: user@spark.apache.org CC: u...@spark.incubator.apache.org Please note that this: for (sentence - sourcerdd) {... } is actually Scala syntactic sugar which is converted into sourcerdd.foreach { sentence

DAGScheduler: Failed to run foreach

2014-06-23 Thread Sameer Tilak
()14/06/23 16:45:04 INFO DAGScheduler: Submitting Stage 0 (MappedRDD[1] at textFile at console:12), which has no missing parents14/06/23 16:45:04 INFO DAGScheduler: Failed to run foreach at CalculateScore.scala:51org.apache.spark.SparkException: Job aborted due to stage failure: Task

RE: DAGScheduler: Failed to run foreach

2014-06-23 Thread Sameer Tilak
The subject should be: org.apache.spark.SparkException: Job aborted due to stage failure: Task not serializable: java.io.NotSerializableException: and not DAGScheduler: Failed to run foreach If I call printScoreCanndedString with a hard-coded string and identical 2nd parameter, it works fine

Re: DAGScheduler: Failed to run foreach

2014-06-23 Thread Aaron Davidson
: java.io.NotSerializableException: and not DAGScheduler: Failed to run foreach If I call printScoreCanndedString with a hard-coded string and identical 2nd parameter, it works fine. However for my application that is not sufficient. -- From: ssti...@live.com To: u