
Could you give the version of Java that you are building with and the
version of Java you are running with? Are they the same?

Just off the cuff, I wonder if this is related to:

If it is, it could appear that certain functions are not in the jar
because they go beyond the extended zip boundary `jar tvf` won't list

- Patrick

On Sun, Jun 8, 2014 at 12:45 PM, Paul Brown <> wrote:
> Moving over to the dev list, as this isn't a user-scope issue.
> I just ran into this issue with the missing saveAsTestFile, and here's a
> little additional information:
> - Code ported from 0.9.1 up to 1.0.0; works with local[n] in both cases.
> - Driver built as an uberjar via Maven.
> - Deployed to smallish EC2 cluster in standalone mode (S3 storage) with
> Spark 1.0.0-hadoop1 downloaded from Apache.
> Given that it functions correctly in local mode but not in a standalone
> cluster, this suggests to me that the issue is in a difference between the
> Maven version and the hadoop1 version.
> In the spirit of taking the computer at its word, we can just have a look
> in the JAR files.  Here's what's in the Maven dep as of 1.0.0:
> jar tvf
> ~/.m2/repository/org/apache/spark/spark-core_2.10/1.0.0/spark-core_2.10-1.0.0.jar
> | grep 'rdd/RDD' | grep 'saveAs'
>   1519 Mon May 26 13:57:58 PDT 2014
> org/apache/spark/rdd/RDD$anonfun$saveAsTextFile$1.class
>   1560 Mon May 26 13:57:58 PDT 2014
> org/apache/spark/rdd/RDD$anonfun$saveAsTextFile$2.class
> And here's what's in the hadoop1 distribution:
> jar tvf spark-assembly-1.0.0-hadoop1.0.4.jar| grep 'rdd/RDD' | grep 'saveAs'
> I.e., it's not there.  It is in the hadoop2 distribution:
> jar tvf spark-assembly-1.0.0-hadoop2.2.0.jar| grep 'rdd/RDD' | grep 'saveAs'
>   1519 Mon May 26 07:29:54 PDT 2014
> org/apache/spark/rdd/RDD$anonfun$saveAsTextFile$1.class
>   1560 Mon May 26 07:29:54 PDT 2014
> org/apache/spark/rdd/RDD$anonfun$saveAsTextFile$2.class
> So something's clearly broken with the way that the distribution assemblies
> are created.
> FWIW and IMHO, the "right" way to publish the hadoop1 and hadoop2 flavors
> of Spark to Maven Central would be as *entirely different* artifacts
> (spark-core-h1, spark-core-h2).
> Logged as SPARK-2075 <>.
> Cheers.
> -- Paul
> --
> | Multifarious, Inc. |
> On Fri, Jun 6, 2014 at 2:45 AM, HenriV <> wrote:
>> I'm experiencing the same error while upgrading from 0.9.1 to 1.0.0.
>> Im using google compute engine and cloud storage. but saveAsTextFile is
>> returning errors while saving in the cloud or saving local. When i start a
>> job in the cluster i do get an error but after this error it keeps on
>> running fine untill the saveAsTextFile. ( I don't know if the two are
>> connected)
>> -----------Error at job startup-------
>>  ERROR metrics.MetricsSystem: Sink class
>> org.apache.spark.metrics.sink.MetricsServlet cannot be instantialized
>> java.lang.reflect.InvocationTargetException
>>         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
>> Method)
>>         at
>> sun.reflect.NativeConstructorAccessorImpl.newInstance(
>>         at
>> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(
>>         at java.lang.reflect.Constructor.newInstance(
>>         at
>> org.apache.spark.metrics.MetricsSystem$$anonfun$registerSinks$1.apply(MetricsSystem.scala:136)
>>         at
>> org.apache.spark.metrics.MetricsSystem$$anonfun$registerSinks$1.apply(MetricsSystem.scala:130)
>>         at
>> scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:98)
>>         at
>> scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:98)
>>         at
>> scala.collection.mutable.HashTable$class.foreachEntry(HashTable.scala:226)
>>         at scala.collection.mutable.HashMap.foreachEntry(HashMap.scala:39)
>>         at scala.collection.mutable.HashMap.foreach(HashMap.scala:98)
>>         at
>> org.apache.spark.metrics.MetricsSystem.registerSinks(MetricsSystem.scala:130)
>>         at
>> org.apache.spark.metrics.MetricsSystem.<init>(MetricsSystem.scala:84)
>>         at
>> org.apache.spark.metrics.MetricsSystem$.createMetricsSystem(MetricsSystem.scala:167)
>>         at org.apache.spark.SparkEnv$.create(SparkEnv.scala:230)
>>         at org.apache.spark.SparkContext.<init>(SparkContext.scala:202)
>>         at Hello$.main(Hello.scala:101)
>>         at Hello.main(Hello.scala)
>>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>         at
>> sun.reflect.NativeMethodAccessorImpl.invoke(
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(
>>         at java.lang.reflect.Method.invoke(
>>         at sbt.Run.invokeMain(Run.scala:72)
>>         at sbt.Run.run0(Run.scala:65)
>>         at sbt.Run.sbt$Run$$execute$1(Run.scala:54)
>>         at sbt.Run$$anonfun$run$1.apply$mcV$sp(Run.scala:58)
>>         at sbt.Run$$anonfun$run$1.apply(Run.scala:58)
>>         at sbt.Run$$anonfun$run$1.apply(Run.scala:58)
>>         at sbt.Logger$$anon$4.apply(Logger.scala:90)
>>         at sbt.TrapExit$
>>         at
>> Caused by: java.lang.NoSuchMethodError:
>> com.fasterxml.jackson.core.JsonFactory.requiresPropertyOrdering()Z
>>         at
>> com.fasterxml.jackson.databind.ObjectMapper.<init>(
>>         at
>> com.fasterxml.jackson.databind.ObjectMapper.<init>(
>>         at
>> org.apache.spark.metrics.sink.MetricsServlet.<init>(MetricsServlet.scala:45)
>>         ... 31 more
>> then it runs fine till i get to saveAsTextFile
>> 14/06/06 09:05:12 INFO scheduler.TaskSetManager: Loss was due to
>> java.lang.ClassNotFoundException:
>> org.apache.spark.rdd.RDD$$anonfun$saveAsTextFile$1 [duplicate 17]
>> 14/06/06 09:05:12 INFO scheduler.DAGScheduler: Failed to run saveAsTextFile
>> at Hello.scala:123
>> 14/06/06 09:05:12 INFO scheduler.TaskSchedulerImpl: Cancelling stage 0
>> [error] (run-main-0) org.apache.spark.SparkException: Job aborted due to
>> stage failure: Task 0.0:3 failed 4 times, most recent failure: Exception
>> failure in TID 142 on host sparky-s1.c.quick-heaven-560.internal:
>> java.lang.ClassNotFoundException:
>> org.apache.spark.rdd.RDD$$anonfun$saveAsTextFile$1
>> [error]$
>> [error]$
>> [error] Method)
>> [error]
>> [error]         java.lang.ClassLoader.loadClass(
>> [error]         java.lang.ClassLoader.loadClass(
>> [error]         java.lang.Class.forName0(Native Method)
>> [error]         java.lang.Class.forName(
>> [error]
>> org.apache.spark.serializer.JavaDeserializationStream$$anon$1.resolveClass(JavaSerializer.scala:60)
>> [error]
>> [error]
>> [error]
>> [error]
>> [error]
>> [error]
>> [error]
>> [error]
>> [error]
>> [error]
>> org.apache.spark.serializer.JavaDeserializationStream.readObject(JavaSerializer.scala:63)
>> [error]
>> org.apache.spark.scheduler.ResultTask$.deserializeInfo(ResultTask.scala:61)
>> [error]
>> org.apache.spark.scheduler.ResultTask.readExternal(ResultTask.scala:141)
>> [error]
>> [error]
>> [error]
>> [error]
>> [error]
>> org.apache.spark.serializer.JavaDeserializationStream.readObject(JavaSerializer.scala:63)
>> [error]
>> org.apache.spark.serializer.JavaSerializerInstance.deserialize(JavaSerializer.scala:85)
>> [error]
>> org.apache.spark.executor.Executor$
>> [error]
>> java.util.concurrent.ThreadPoolExecutor.runWorker(
>> [error]
>> java.util.concurrent.ThreadPoolExecutor$
>> [error]
>> Thanks for any help or guidance.
>> --
>> View this message in context:
>> Sent from the Apache Spark User List mailing list archive at

Reply via email to