np ;-)
On Wed, Apr 2, 2014 at 5:50 PM, Leon Zhang <leonca...@gmail.com> wrote: > Aha, thank you for your kind reply. > > Upgrading to 0.9.1 is a good choice. :) > > > On Wed, Apr 2, 2014 at 11:35 PM, andy petrella <andy.petre...@gmail.com>wrote: > >> Heya, >> >> Yep this is a problem in the Mesos scheduler implementation that has been >> fixed after 0.9.0 (https://spark-project.atlassian.net/browse/SPARK-1052=> >> MesosSchedulerBackend) >> >> So several options, like applying the patch, upgrading to 0.9.1 :-/ >> >> Cheers, >> Andy >> >> >> On Wed, Apr 2, 2014 at 5:30 PM, Leon Zhang <leonca...@gmail.com> wrote: >> >>> Hi, Spark Devs: >>> >>> I encounter a problem which shows error message >>> "akka.actor.ActorNotFound" on our mesos mini-cluster. >>> >>> mesos : 0.17.0 >>> spark : spark-0.9.0-incubating >>> >>> spark-env.sh: >>> #!/usr/bin/env bash >>> >>> export MESOS_NATIVE_LIBRARY=/usr/local/lib/libmesos.so >>> export SPARK_EXECUTOR_URI=hdfs:// >>> 192.168.1.20/tmp/spark-0.9.0-incubating-hadoop_2.0.0-cdh4.6.0-bin.tar.gz >>> export MASTER=zk://192.168.1.20:2181/mesos >>> export SPARK_JAVA_OPTS="-Dspark.driver.port=17077" >>> >>> And the logs from each slave looks like: >>> >>> 14/04/02 15:14:37 INFO MesosExecutorBackend: Using Spark's default log4j >>> profile: org/apache/spark/log4j-defaults.properties >>> 14/04/02 15:14:37 INFO MesosExecutorBackend: Registered with Mesos as >>> executor ID 201403301937-335653056-5050-991-1 >>> 14/04/02 15:14:38 INFO Slf4jLogger: Slf4jLogger started >>> 14/04/02 15:14:38 INFO Remoting: Starting remoting >>> 14/04/02 15:14:38 INFO Remoting: Remoting started; listening on >>> addresses :[akka.tcp://spark@zetyun-cloud3:42218] >>> 14/04/02 15:14:38 INFO Remoting: Remoting now listens on addresses: >>> [akka.tcp://spark@zetyun-cloud3:42218] >>> 14/04/02 15:14:38 INFO SparkEnv: Connecting to BlockManagerMaster: >>> akka.tcp://spark@localhost:17077/user/BlockManagerMaster >>> akka.actor.ActorNotFound: Actor not found for: >>> ActorSelection[Actor[akka.tcp://spark@localhost >>> :17077/]/user/BlockManagerMaster] >>> at >>> akka.actor.ActorSelection$anonfun$resolveOne$1.apply(ActorSelection.scala:66) >>> at >>> akka.actor.ActorSelection$anonfun$resolveOne$1.apply(ActorSelection.scala:64) >>> at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:32) >>> at >>> akka.dispatch.BatchingExecutor$Batch$anonfun$run$1.processBatch$1(BatchingExecutor.scala:67) >>> at >>> akka.dispatch.BatchingExecutor$Batch$anonfun$run$1.apply$mcV$sp(BatchingExecutor.scala:82) >>> at >>> akka.dispatch.BatchingExecutor$Batch$anonfun$run$1.apply(BatchingExecutor.scala:59) >>> at >>> akka.dispatch.BatchingExecutor$Batch$anonfun$run$1.apply(BatchingExecutor.scala:59) >>> at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:72) >>> at akka.dispatch.BatchingExecutor$Batch.run(BatchingExecutor.scala:58) >>> at >>> akka.dispatch.ExecutionContexts$sameThreadExecutionContext$.unbatchedExecute(Future.scala:74) >>> at >>> akka.dispatch.BatchingExecutor$class.execute(BatchingExecutor.scala:110) >>> at >>> akka.dispatch.ExecutionContexts$sameThreadExecutionContext$.execute(Future.scala:73) >>> at >>> scala.concurrent.impl.CallbackRunnable.executeWithValue(Promise.scala:40) >>> at >>> scala.concurrent.impl.Promise$DefaultPromise.tryComplete(Promise.scala:248) >>> at akka.pattern.PromiseActorRef.$bang(AskSupport.scala:269) >>> at akka.actor.EmptyLocalActorRef.specialHandle(ActorRef.scala:512) >>> at akka.actor.DeadLetterActorRef.specialHandle(ActorRef.scala:545) >>> at akka.actor.DeadLetterActorRef.$bang(ActorRef.scala:535) >>> at >>> akka.remote.RemoteActorRefProvider$RemoteDeadLetterActorRef.$bang(RemoteActorRefProvider.scala:91) >>> at akka.actor.ActorRef.tell(ActorRef.scala:125) >>> at akka.dispatch.Mailboxes$anon$1$anon$2.enqueue(Mailboxes.scala:44) >>> at akka.dispatch.QueueBasedMessageQueue$class.cleanUp(Mailbox.scala:438) >>> at >>> akka.dispatch.UnboundedDequeBasedMailbox$MessageQueue.cleanUp(Mailbox.scala:650) >>> at akka.dispatch.Mailbox.cleanUp(Mailbox.scala:309) >>> at >>> akka.dispatch.MessageDispatcher.unregister(AbstractDispatcher.scala:204) >>> at akka.dispatch.MessageDispatcher.detach(AbstractDispatcher.scala:140) >>> at >>> akka.actor.dungeon.FaultHandling$class.akka$actor$dungeon$FaultHandling$finishTerminate(FaultHandling.scala:203) >>> at >>> akka.actor.dungeon.FaultHandling$class.terminate(FaultHandling.scala:163) >>> at akka.actor.ActorCell.terminate(ActorCell.scala:338) >>> at akka.actor.ActorCell.invokeAll$1(ActorCell.scala:431) >>> at akka.actor.ActorCell.systemInvoke(ActorCell.scala:447) >>> at akka.dispatch.Mailbox.processAllSystemMessages(Mailbox.scala:262) >>> at akka.dispatch.Mailbox.run(Mailbox.scala:218) >>> at >>> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:386) >>> at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) >>> at >>> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) >>> at >>> scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) >>> at >>> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) >>> Exception in thread "Thread-0" >>> >>> Any clue for this problem? >>> >>> Thanks in advance. >>> >> >> >