[ https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Davies Liu updated SPARK-5098: ------------------------------ Description: 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. 15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost) 15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost) 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6 [Stage 0:===========================================================(44 + -14) / 40] 15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated 15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. 15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost) 15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost) 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2 [Stage 0:==========================================================(45 + -29) / 40] was: 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. 15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost) 15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost) 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6 [Stage 0:=================================================================================================================(44 + -14) / 40] 15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated 15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. 15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost) 15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost) 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2 [Stage 0:=================================================================================================================(45 + -29) / 40] > Number of running tasks become negative after tasks lost > -------------------------------------------------------- > > Key: SPARK-5098 > URL: https://issues.apache.org/jira/browse/SPARK-5098 > Project: Spark > Issue Type: Bug > Components: Spark Core > Affects Versions: 1.2.0 > Reporter: Davies Liu > Priority: Critical > > 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on > spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated > 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote > system > [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] > has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. > 15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, > spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 > lost) > 15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, > spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 > lost) > 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove > non-existent executor 6 > 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove > non-existent executor 6 > [Stage 0:===========================================================(44 + > -14) / 40] > 15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on > spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated > 15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote > system > [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] > has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. > 15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, > spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 > lost) > 15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, > spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 > lost) > 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove > non-existent executor 2 > 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove > non-existent executor 2 > [Stage 0:==========================================================(45 + -29) > / 40] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org