I'll kick this vote off with a +1.

On Thu, Jan 16, 2014 at 10:43 AM, Patrick Wendell <pwend...@gmail.com> wrote:
> I also ran your example locally and it worked with 0.8.1 and
> 0.9.0-rc1. So it's possible somehow you are pulling in an older
> version if Spark or an incompatible version of Hadoop.
>
> - Patrick
>
> On Thu, Jan 16, 2014 at 9:39 AM, Patrick Wendell <pwend...@gmail.com> wrote:
>> Hey Alex,
>>
>> Thanks for testing out this rc. Would you mind forking this into a different
>> thread so we can discuss there?
>>
>> Also, does your application build and run correctly with spark 0.8.1? That
>> would determine whether the problem is specifically with this rc...
>>
>> Patrick
>>
>> ---
>> sent from my phone
>>
>> On Jan 15, 2014 11:44 PM, "Alex Cozzi" <alexco...@gmail.com> wrote:
>>>
>>> Oh, I forgot: I am using the “yarn” maven profile to target yarn 2.2
>>>
>>> Alex Cozzi
>>> alexco...@gmail.com
>>> On Jan 15, 2014, at 11:41 PM, Alex Cozzi <alexco...@gmail.com> wrote:
>>>
>>> > Just testing out the rc1. I create a dependent project (using maven) and
>>> > I copied the HdfsTest.scala test, but I added a single line to save the 
>>> > file
>>> > back to disk:
>>> >
>>> > package org.apache.spark.examples
>>> >
>>> > import org.apache.spark._
>>> >
>>> > object HdfsTest {
>>> >   def main(args: Array[String]) {
>>> >     val sc = new SparkContext(args(0), "HdfsTest",
>>> >       System.getenv("SPARK_HOME"),
>>> > SparkContext.jarOfClass(this.getClass))
>>> >     val file = sc.textFile(args(1))
>>> >     val mapped = file.map(s => s.length).cache()
>>> >     for (iter <- 1 to 10) {
>>> >       val start = System.currentTimeMillis()
>>> >       for (x <- mapped) { x + 2 }
>>> >       //  println("Processing: " + x)
>>> >       val end = System.currentTimeMillis()
>>> >       println("Iteration " + iter + " took " + (end-start) + " ms")
>>> >       mapped.saveAsTextFile("out")
>>> >     }
>>> >     System.exit(0)
>>> >   }
>>> > }
>>> >
>>> > and this my pom file:
>>> > <project xmlns="http://maven.apache.org/POM/4.0.0";
>>> > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance";
>>> >       xsi:schemaLocation="http://maven.apache.org/POM/4.0.0
>>> > http://maven.apache.org/maven-v4_0_0.xsd";>
>>> >       <modelVersion>4.0.0</modelVersion>
>>> >       <groupId>my.examples</groupId>
>>> >       <artifactId>spark-samples</artifactId>
>>> >       <version>0.0.1-SNAPSHOT</version>
>>> >       <inceptionYear>2014</inceptionYear>
>>> >
>>> >       <properties>
>>> >               <maven.compiler.source>1.6</maven.compiler.source>
>>> >               <maven.compiler.target>1.6</maven.compiler.target>
>>> >               <encoding>UTF-8</encoding>
>>> >               <scala.tools.version>2.10</scala.tools.version>
>>> >               <scala.version>2.10.0</scala.version>
>>> >       </properties>
>>> >
>>> >       <repositories>
>>> >               <repository>
>>> >               <id>spark staging</id>
>>> >
>>> > <url>https://repository.apache.org/content/repositories/orgapachespark-1001</url>
>>> >               </repository>
>>> >       </repositories>
>>> >
>>> >       <dependencies>
>>> >               <dependency>
>>> >                       <groupId>org.scala-lang</groupId>
>>> >                       <artifactId>scala-library</artifactId>
>>> >                       <version>${scala.version}</version>
>>> >               </dependency>
>>> >
>>> >               <dependency>
>>> >                       <groupId>org.apache.spark</groupId>
>>> >
>>> > <artifactId>spark-core_${scala.tools.version}</artifactId>
>>> >                       <version>0.9.0-incubating</version>
>>> >               </dependency>
>>> >
>>> >               <!-- Test -->
>>> >               <dependency>
>>> >                       <groupId>junit</groupId>
>>> >                       <artifactId>junit</artifactId>
>>> >                       <version>4.11</version>
>>> >                       <scope>test</scope>
>>> >               </dependency>
>>> >               <dependency>
>>> >                       <groupId>org.specs2</groupId>
>>> >
>>> > <artifactId>specs2_${scala.tools.version}</artifactId>
>>> >                       <version>1.13</version>
>>> >                       <scope>test</scope>
>>> >               </dependency>
>>> >               <dependency>
>>> >                       <groupId>org.scalatest</groupId>
>>> >
>>> > <artifactId>scalatest_${scala.tools.version}</artifactId>
>>> >                       <version>2.0.M6-SNAP8</version>
>>> >                       <scope>test</scope>
>>> >               </dependency>
>>> >       </dependencies>
>>> >
>>> >       <build>
>>> >               <sourceDirectory>src/main/scala</sourceDirectory>
>>> >               <testSourceDirectory>src/test/scala</testSourceDirectory>
>>> >               <plugins>
>>> >                       <plugin>
>>> >                               <!-- see
>>> > http://davidb.github.com/scala-maven-plugin -->
>>> >                               <groupId>net.alchim31.maven</groupId>
>>> >
>>> > <artifactId>scala-maven-plugin</artifactId>
>>> >                               <version>3.1.6</version>
>>> >                               <configuration>
>>> >
>>> > <scalaCompatVersion>2.10</scalaCompatVersion>
>>> >                                       <jvmArgs>
>>> >                                               <jvmArg>-Xms128m</jvmArg>
>>> >                                               <jvmArg>-Xmx2048m</jvmArg>
>>> >                                       </jvmArgs>
>>> >                               </configuration>
>>> >                               <executions>
>>> >                                       <execution>
>>> >                                               <goals>
>>> >
>>> > <goal>compile</goal>
>>> >
>>> > <goal>testCompile</goal>
>>> >                                               </goals>
>>> >                                               <configuration>
>>> >                                                       <args>
>>> >
>>> > <arg>-make:transitive</arg>
>>> >
>>> > <arg>-dependencyfile</arg>
>>> >
>>> > <arg>${project.build.directory}/.scala_dependencies</arg>
>>> >                                                       </args>
>>> >                                               </configuration>
>>> >                                       </execution>
>>> >                               </executions>
>>> >                       </plugin>
>>> >                       <plugin>
>>> >
>>> > <groupId>org.apache.maven.plugins</groupId>
>>> >
>>> > <artifactId>maven-surefire-plugin</artifactId>
>>> >                               <version>2.13</version>
>>> >                               <configuration>
>>> >                                       <useFile>false</useFile>
>>> >
>>> > <disableXmlReport>true</disableXmlReport>
>>> >                                       <!-- If you have classpath issue
>>> > like NoDefClassError,... -->
>>> >                                       <!--
>>> > useManifestOnlyJar>false</useManifestOnlyJar -->
>>> >                                       <includes>
>>> >
>>> > <include>**/*Test.*</include>
>>> >
>>> > <include>**/*Suite.*</include>
>>> >                                       </includes>
>>> >                               </configuration>
>>> >                       </plugin>
>>> >                       <plugin>
>>> >                               <groupId>org.codehaus.mojo</groupId>
>>> >                               <artifactId>exec-maven-plugin</artifactId>
>>> >                               <version>1.2.1</version>
>>> >                               <executions>
>>> >                                       <execution>
>>> >                                               <goals>
>>> >                                                       <goal>exec</goal>
>>> >                                               </goals>
>>> >                                       </execution>
>>> >                               </executions>
>>> >                               <configuration>
>>> >
>>> > <mainClass>org.apache.spark.examples.HdfsTest</mainClass>
>>> >                                       <arguments>
>>> >                                               <argument>local</argument>
>>> >
>>> > <argument>pom.xml</argument>
>>> >                                       </arguments>
>>> >                               </configuration>
>>> >                       </plugin>
>>> >               </plugins>
>>> >       </build>
>>> > </project>
>>> >
>>> >
>>> > now, when I run it either in eclipse or using "mvn exec:java"  I get the
>>> > following error:
>>> > [INFO]
>>> > [INFO] --- exec-maven-plugin:1.2.1:java (default-cli) @ spark-samples
>>> > ---
>>> > SLF4J: Class path contains multiple SLF4J bindings.
>>> > SLF4J: Found binding in
>>> > [jar:file:/Users/acozzi/.m2/repository/org/slf4j/slf4j-log4j12/1.6.1/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
>>> > SLF4J: Found binding in
>>> > [jar:file:/Users/acozzi/.m2/repository/org/slf4j/slf4j-simple/1.6.1/slf4j-simple-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
>>> > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
>>> > explanation.
>>> > SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
>>> > 14/01/15 23:37:57 INFO slf4j.Slf4jLogger: Slf4jLogger started
>>> > 14/01/15 23:37:57 INFO Remoting: Starting remoting
>>> > 14/01/15 23:37:57 INFO Remoting: Remoting started; listening on
>>> > addresses :[akka.tcp://spark@10.0.1.10:53682]
>>> > 14/01/15 23:37:57 INFO Remoting: Remoting now listens on addresses:
>>> > [akka.tcp://spark@10.0.1.10:53682]
>>> > 14/01/15 23:37:57 INFO spark.SparkEnv: Registering BlockManagerMaster
>>> > 14/01/15 23:37:57 INFO storage.DiskBlockManager: Created local directory
>>> > at
>>> > /var/folders/mm/4qxz27w91p96v2zp5f9ncmqm38ychm/T/spark-local-20140115233757-7a41
>>> > 14/01/15 23:37:57 INFO storage.MemoryStore: MemoryStore started with
>>> > capacity 1218.8 MB.
>>> > 14/01/15 23:37:57 INFO network.ConnectionManager: Bound socket to port
>>> > 53683 with id = ConnectionManagerId(10.0.1.10,53683)
>>> > 14/01/15 23:37:57 INFO storage.BlockManagerMaster: Trying to register
>>> > BlockManager
>>> > 14/01/15 23:37:57 INFO storage.BlockManagerMasterActor$BlockManagerInfo:
>>> > Registering block manager 10.0.1.10:53683 with 1218.8 MB RAM
>>> > 14/01/15 23:37:57 INFO storage.BlockManagerMaster: Registered
>>> > BlockManager
>>> > 14/01/15 23:37:57 INFO spark.HttpServer: Starting HTTP Server
>>> > 14/01/15 23:37:57 INFO server.Server: jetty-7.6.8.v20121106
>>> > 14/01/15 23:37:57 INFO server.AbstractConnector: Started
>>> > SocketConnector@0.0.0.0:53684
>>> > 14/01/15 23:37:57 INFO broadcast.HttpBroadcast: Broadcast server started
>>> > at http://10.0.1.10:53684
>>> > 14/01/15 23:37:57 INFO spark.SparkEnv: Registering MapOutputTracker
>>> > 14/01/15 23:37:57 INFO spark.HttpFileServer: HTTP File server directory
>>> > is
>>> > /var/folders/mm/4qxz27w91p96v2zp5f9ncmqm38ychm/T/spark-e9304513-3714-430f-aa14-1a430a915d98
>>> > 14/01/15 23:37:57 INFO spark.HttpServer: Starting HTTP Server
>>> > 14/01/15 23:37:57 INFO server.Server: jetty-7.6.8.v20121106
>>> > 14/01/15 23:37:57 INFO server.AbstractConnector: Started
>>> > SocketConnector@0.0.0.0:53685
>>> > 14/01/15 23:37:57 INFO server.Server: jetty-7.6.8.v20121106
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/storage/rdd,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/storage,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/stages/stage,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/stages/pool,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/stages,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/environment,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/executors,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/metrics/json,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/static,null}
>>> > 14/01/15 23:37:57 INFO handler.ContextHandler: started
>>> > o.e.j.s.h.ContextHandler{/,null}
>>> > 14/01/15 23:37:57 INFO server.AbstractConnector: Started
>>> > SelectChannelConnector@0.0.0.0:4040
>>> > 14/01/15 23:37:57 INFO ui.SparkUI: Started Spark Web UI at
>>> > http://10.0.1.10:4040
>>> > 2014-01-15 23:37:57.929 java[34819:1020b] Unable to load realm mapping
>>> > info from SCDynamicStore
>>> > 14/01/15 23:37:58 INFO storage.MemoryStore: ensureFreeSpace(35456)
>>> > called with curMem=0, maxMem=1278030643
>>> > 14/01/15 23:37:58 INFO storage.MemoryStore: Block broadcast_0 stored as
>>> > values to memory (estimated size 34.6 KB, free 1218.8 MB)
>>> > 14/01/15 23:37:58 WARN util.NativeCodeLoader: Unable to load
>>> > native-hadoop library for your platform... using builtin-java classes 
>>> > where
>>> > applicable
>>> > 14/01/15 23:37:58 WARN snappy.LoadSnappy: Snappy native library not
>>> > loaded
>>> > 14/01/15 23:37:58 INFO mapred.FileInputFormat: Total input paths to
>>> > process : 1
>>> > 14/01/15 23:37:58 INFO spark.SparkContext: Starting job: foreach at
>>> > HdfsTest.scala:30
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Got job 0 (foreach at
>>> > HdfsTest.scala:30) with 1 output partitions (allowLocal=false)
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Final stage: Stage 0
>>> > (foreach at HdfsTest.scala:30)
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Parents of final stage:
>>> > List()
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Missing parents: List()
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Submitting Stage 0
>>> > (MappedRDD[2] at map at HdfsTest.scala:27), which has no missing parents
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Submitting 1 missing
>>> > tasks from Stage 0 (MappedRDD[2] at map at HdfsTest.scala:27)
>>> > 14/01/15 23:37:58 INFO scheduler.TaskSchedulerImpl: Adding task set 0.0
>>> > with 1 tasks
>>> > 14/01/15 23:37:58 INFO scheduler.TaskSetManager: Starting task 0.0:0 as
>>> > TID 0 on executor localhost: localhost (PROCESS_LOCAL)
>>> > 14/01/15 23:37:58 INFO scheduler.TaskSetManager: Serialized task 0.0:0
>>> > as 1778 bytes in 5 ms
>>> > 14/01/15 23:37:58 INFO executor.Executor: Running task ID 0
>>> > 14/01/15 23:37:58 INFO storage.BlockManager: Found block broadcast_0
>>> > locally
>>> > 14/01/15 23:37:58 INFO spark.CacheManager: Partition rdd_2_0 not found,
>>> > computing it
>>> > 14/01/15 23:37:58 INFO rdd.HadoopRDD: Input split:
>>> > file:/Users/acozzi/Documents/workspace/spark-samples/pom.xml:0+4092
>>> > 14/01/15 23:37:58 INFO storage.MemoryStore: ensureFreeSpace(2853) called
>>> > with curMem=35456, maxMem=1278030643
>>> > 14/01/15 23:37:58 INFO storage.MemoryStore: Block rdd_2_0 stored as
>>> > values to memory (estimated size 2.8 KB, free 1218.8 MB)
>>> > 14/01/15 23:37:58 INFO storage.BlockManagerMasterActor$BlockManagerInfo:
>>> > Added rdd_2_0 in memory on 10.0.1.10:53683 (size: 2.8 KB, free: 1218.8 MB)
>>> > 14/01/15 23:37:58 INFO storage.BlockManagerMaster: Updated info of block
>>> > rdd_2_0
>>> > 14/01/15 23:37:58 INFO executor.Executor: Serialized size of result for
>>> > 0 is 525
>>> > 14/01/15 23:37:58 INFO executor.Executor: Sending result for 0 directly
>>> > to driver
>>> > 14/01/15 23:37:58 INFO executor.Executor: Finished task ID 0
>>> > 14/01/15 23:37:58 INFO scheduler.TaskSetManager: Finished TID 0 in 61 ms
>>> > on localhost (progress: 0/1)
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Completed ResultTask(0,
>>> > 0)
>>> > 14/01/15 23:37:58 INFO scheduler.TaskSchedulerImpl: Remove TaskSet 0.0
>>> > from pool
>>> > 14/01/15 23:37:58 INFO scheduler.DAGScheduler: Stage 0 (foreach at
>>> > HdfsTest.scala:30) finished in 0.071 s
>>> > 14/01/15 23:37:58 INFO spark.SparkContext: Job finished: foreach at
>>> > HdfsTest.scala:30, took 0.151199 s
>>> > Iteration 1 took 189 ms
>>> > [WARNING]
>>> > java.lang.reflect.InvocationTargetException
>>> >       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>> >       at
>>> > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>> >       at
>>> > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>>> >       at java.lang.reflect.Method.invoke(Method.java:597)
>>> >       at
>>> > org.codehaus.mojo.exec.ExecJavaMojo$1.run(ExecJavaMojo.java:297)
>>> >       at java.lang.Thread.run(Thread.java:695)
>>> > Caused by: java.lang.IncompatibleClassChangeError: Implementing class
>>> >       at java.lang.ClassLoader.defineClass1(Native Method)
>>> >       at java.lang.ClassLoader.defineClassCond(ClassLoader.java:637)
>>> >       at java.lang.ClassLoader.defineClass(ClassLoader.java:621)
>>> >       at
>>> > java.security.SecureClassLoader.defineClass(SecureClassLoader.java:141)
>>> >       at java.net.URLClassLoader.defineClass(URLClassLoader.java:283)
>>> >       at java.net.URLClassLoader.access$000(URLClassLoader.java:58)
>>> >       at java.net.URLClassLoader$1.run(URLClassLoader.java:197)
>>> >       at java.security.AccessController.doPrivileged(Native Method)
>>> >       at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>>> >       at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
>>> >       at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
>>> >       at java.lang.Class.forName0(Native Method)
>>> >       at java.lang.Class.forName(Class.java:171)
>>> >       at
>>> > org.apache.hadoop.mapred.SparkHadoopMapRedUtil$class.firstAvailableClass(SparkHadoopMapRedUtil.scala:48)
>>> >       at
>>> > org.apache.hadoop.mapred.SparkHadoopMapRedUtil$class.newJobContext(SparkHadoopMapRedUtil.scala:23)
>>> >       at
>>> > org.apache.hadoop.mapred.SparkHadoopWriter.newJobContext(SparkHadoopWriter.scala:40)
>>> >       at
>>> > org.apache.hadoop.mapred.SparkHadoopWriter.getJobContext(SparkHadoopWriter.scala:149)
>>> >       at
>>> > org.apache.hadoop.mapred.SparkHadoopWriter.preSetup(SparkHadoopWriter.scala:64)
>>> >       at
>>> > org.apache.spark.rdd.PairRDDFunctions.saveAsHadoopDataset(PairRDDFunctions.scala:713)
>>> >       at
>>> > org.apache.spark.rdd.PairRDDFunctions.saveAsHadoopFile(PairRDDFunctions.scala:686)
>>> >       at
>>> > org.apache.spark.rdd.PairRDDFunctions.saveAsHadoopFile(PairRDDFunctions.scala:572)
>>> >       at org.apache.spark.rdd.RDD.saveAsTextFile(RDD.scala:894)
>>> >       at
>>> > org.apache.spark.examples.HdfsTest$$anonfun$main$1.apply$mcVI$sp(HdfsTest.scala:34)
>>> >       at
>>> > scala.collection.immutable.Range.foreach$mVc$sp(Range.scala:142)
>>> >       at org.apache.spark.examples.HdfsTest$.main(HdfsTest.scala:28)
>>> >       at org.apache.spark.examples.HdfsTest.main(HdfsTest.scala)
>>> >       ... 6 more
>>> > [INFO]
>>> > ------------------------------------------------------------------------
>>> > [INFO] BUILD FAILURE
>>> > [INFO]
>>> > ------------------------------------------------------------------------
>>> > [INFO] Total time: 3.224s
>>> > [INFO] Finished at: Wed Jan 15 23:37:58 PST 2014
>>> > [INFO] Final Memory: 12M/81M
>>> > [INFO]
>>> > ------------------------------------------------------------------------
>>> > [ERROR] Failed to execute goal
>>> > org.codehaus.mojo:exec-maven-plugin:1.2.1:java (default-cli) on project
>>> > spark-samples: An exception occured while executing the Java class. null:
>>> > InvocationTargetException: Implementing class -> [Help 1]
>>> >
>>> >
>>> > Alex Cozzi
>>> > alexco...@gmail.com
>>> > On Jan 15, 2014, at 5:48 PM, Patrick Wendell <pwend...@gmail.com> wrote:
>>> >
>>> >> Please vote on releasing the following candidate as Apache Spark
>>> >> (incubating) version 0.9.0.
>>> >>
>>> >> A draft of the release notes along with the changes file is attached
>>> >> to this e-mail.
>>> >>
>>> >> The tag to be voted on is v0.9.0-incubating (commit 7348893):
>>> >>
>>> >> https://git-wip-us.apache.org/repos/asf?p=incubator-spark.git;a=commit;h=7348893f0edd96dacce2f00970db1976266f7008
>>> >>
>>> >> The release files, including signatures, digests, etc can be found at:
>>> >> http://people.apache.org/~pwendell/spark-0.9.0-incubating-rc1/
>>> >>
>>> >> Release artifacts are signed with the following key:
>>> >> https://people.apache.org/keys/committer/pwendell.asc
>>> >>
>>> >> The staging repository for this release can be found at:
>>> >> https://repository.apache.org/content/repositories/orgapachespark-1001/
>>> >>
>>> >> The documentation corresponding to this release can be found at:
>>> >> http://people.apache.org/~pwendell/spark-0.9.0-incubating-rc1-docs/
>>> >>
>>> >> Please vote on releasing this package as Apache Spark 0.9.0-incubating!
>>> >>
>>> >> The vote is open until Sunday, January 19, at 02:00 UTC
>>> >> and passes if a majority of at least 3 +1 PPMC votes are cast.
>>> >>
>>> >> [ ] +1 Release this package as Apache Spark 0.9.0-incubating
>>> >> [ ] -1 Do not release this package because ...
>>> >>
>>> >> To learn more about Apache Spark, please see
>>> >> http://spark.incubator.apache.org/
>>> >
>>>
>>

Reply via email to