Date: Mon, 23 Jun 2014 18:00:27 -0700
Subject: Re: DAGScheduler: Failed to run foreach
To: user@spark.apache.org
CC: u...@spark.incubator.apache.org
Please note that this:
for (sentence - sourcerdd) {... }
is actually Scala syntactic sugar which is converted into
sourcerdd.foreach { sentence
()14/06/23 16:45:04 INFO DAGScheduler: Submitting Stage 0
(MappedRDD[1] at textFile at console:12), which has no missing
parents14/06/23 16:45:04 INFO DAGScheduler: Failed to run foreach at
CalculateScore.scala:51org.apache.spark.SparkException: Job aborted due to
stage failure: Task
The subject should be: org.apache.spark.SparkException: Job aborted due to
stage failure: Task not serializable: java.io.NotSerializableException: and
not DAGScheduler: Failed to run foreach
If I call printScoreCanndedString with a hard-coded string and identical 2nd
parameter, it works fine
: java.io.NotSerializableException:
and not DAGScheduler: Failed to run foreach
If I call printScoreCanndedString with a hard-coded string and identical
2nd parameter, it works fine. However for my application that is not
sufficient.
--
From: ssti...@live.com
To: u