[ https://issues.apache.org/jira/browse/SPARK-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Matthew Farrellee closed SPARK-538. ----------------------------------- Resolution: Done > INFO spark.MesosScheduler: Ignoring update from TID 9 because its job is gone > ----------------------------------------------------------------------------- > > Key: SPARK-538 > URL: https://issues.apache.org/jira/browse/SPARK-538 > Project: Spark > Issue Type: Bug > Reporter: vince67 > > Hi Matei, > Maybe I can't descibe it clearly. > We run masters or slaves on different machines,it is success. > But when we run spark.examples.SparkPi on the master , our > process hangs,we have not got the result. > Descirption like these: > > > 12/09/02 16:47:54 INFO spark.BoundedMemoryCache: BoundedMemoryCache.maxBytes > = 339585269 > 12/09/02 16:47:54 INFO spark.CacheTrackerActor: Registered actor on port 7077 > 12/09/02 16:47:54 INFO spark.CacheTrackerActor: Started slave cache (size > 323.9MB) on vince67-ThinkCentre-XXXX > 12/09/02 16:47:54 INFO spark.MapOutputTrackerActor: Registered actor on port > 7077 > 12/09/02 16:47:54 INFO spark.ShuffleManager: Shuffle dir: > /tmp/spark-local-3e79b235-1b94-44d1-823b-0369f6698688/shuffle > 12/09/02 16:47:54 INFO server.Server: jetty-7.5.3.v20111011 > 12/09/02 16:47:54 INFO server.AbstractConnector: Started > SelectChannelConnector@0.0.0.0:49578 STARTING > 12/09/02 16:47:54 INFO spark.ShuffleManager: Local URI: > http://ip.ip.ip.ip:49578 > 12/09/02 16:47:55 INFO server.Server: jetty-7.5.3.v20111011 > 12/09/02 16:47:55 INFO server.AbstractConnector: Started > SelectChannelConnector@0.0.0.0:49600 STARTING > 12/09/02 16:47:55 INFO broadcast.HttpBroadcast: Broadcast server started at > http://ip.ip.ip.ip:49600 > 12/09/02 16:47:55 INFO spark.MesosScheduler: Registered as framework ID > 201209021640-74572372-5050-16898-0004 > 12/09/02 16:47:55 INFO spark.SparkContext: Starting job... > 12/09/02 16:47:55 INFO spark.CacheTracker: Registering RDD ID 1 with cache > 12/09/02 16:47:55 INFO spark.CacheTrackerActor: Registering RDD 1 with 2 > partitions > 12/09/02 16:47:55 INFO spark.CacheTracker: Registering RDD ID 0 with cache > 12/09/02 16:47:55 INFO spark.CacheTrackerActor: Registering RDD 0 with 2 > partitions > 12/09/02 16:47:55 INFO spark.CacheTrackerActor: Asked for current cache > locations > 12/09/02 16:47:55 INFO spark.MesosScheduler: Final stage: Stage 0 > 12/09/02 16:47:55 INFO spark.MesosScheduler: Parents of final stage: List() > 12/09/02 16:47:55 INFO spark.MesosScheduler: Missing parents: List() > 12/09/02 16:47:55 INFO spark.MesosScheduler: Submitting Stage 0, which has no > missing parents > 12/09/02 16:47:55 INFO spark.MesosScheduler: Got a job with 2 tasks > 12/09/02 16:47:55 INFO spark.MesosScheduler: Adding job with ID 0 > 12/09/02 16:47:55 INFO spark.SimpleJob: Starting task 0:0 as TID 0 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:55 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and > took 151 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:55 INFO spark.SimpleJob: Starting task 0:1 as TID 1 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:55 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and > took 1 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:56 INFO spark.SimpleJob: Lost TID 0 (task 0:0) > 12/09/02 16:47:56 INFO spark.SimpleJob: Starting task 0:0 as TID 2 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:56 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and > took 1 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:56 INFO spark.SimpleJob: Lost TID 1 (task 0:1) > 12/09/02 16:47:56 INFO spark.SimpleJob: Starting task 0:1 as TID 3 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:56 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and > took 5 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:57 INFO spark.SimpleJob: Lost TID 2 (task 0:0) > 12/09/02 16:47:57 INFO spark.SimpleJob: Starting task 0:0 as TID 4 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:57 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and > took 1 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:57 INFO spark.SimpleJob: Lost TID 3 (task 0:1) > 12/09/02 16:47:57 INFO spark.SimpleJob: Starting task 0:1 as TID 5 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:57 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and > took 2 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:58 INFO spark.SimpleJob: Lost TID 4 (task 0:0) > 12/09/02 16:47:58 INFO spark.SimpleJob: Starting task 0:0 as TID 6 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:58 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and > took 1 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:58 INFO spark.SimpleJob: Lost TID 5 (task 0:1) > 12/09/02 16:47:58 INFO spark.SimpleJob: Starting task 0:1 as TID 7 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:58 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and > took 1 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:59 INFO spark.SimpleJob: Lost TID 6 (task 0:0) > 12/09/02 16:47:59 INFO spark.SimpleJob: Starting task 0:0 as TID 8 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:59 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and > took 1 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:47:59 INFO spark.SimpleJob: Lost TID 7 (task 0:1) > 12/09/02 16:47:59 INFO spark.SimpleJob: Starting task 0:1 as TID 9 on slave > 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred) > 12/09/02 16:47:59 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and > took 1 ms to serialize by spark.JavaSerializerInstance > 12/09/02 16:48:00 INFO spark.SimpleJob: Lost TID 8 (task 0:0) > 12/09/02 16:48:00 ERROR spark.SimpleJob: Task 0:0 failed more than 4 times; > aborting job > 12/09/02 16:48:00 INFO spark.MesosScheduler: Ignoring update from TID 9 > because its job is gone > Your help will be appreciate. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org