[jira] [Updated] (SPARK-5098) Number of running tasks become negative after tasks lost

2015-05-04 Thread Sean Owen (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-5098:
-
Target Version/s:   (was: 1.2.1, 1.3.0)

> Number of running tasks become negative after tasks lost
> 
>
> Key: SPARK-5098
> URL: https://issues.apache.org/jira/browse/SPARK-5098
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Core
>Affects Versions: 1.2.0
>Reporter: Davies Liu
>Priority: Critical
>
> 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on 
> spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
> 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote 
> system 
> [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] 
> has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, 
> spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 
> lost)
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, 
> spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 
> lost)
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove 
> non-existent executor 6
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove 
> non-existent executor 6
> [Stage 0:===(44 + 
> -14) / 40]
> 15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on 
> spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated
> 15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote 
> system 
> [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] 
> has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, 
> spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 
> lost)
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, 
> spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 
> lost)
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove 
> non-existent executor 2
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove 
> non-existent executor 2
> [Stage 0:==(45 + -29) 
> / 40]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-5098) Number of running tasks become negative after tasks lost

2015-01-05 Thread Davies Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Davies Liu updated SPARK-5098:
--
Description: 
15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on 
spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote 
system 
[akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has 
failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, 
spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 
lost)
15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, 
spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 
lost)
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 6
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 6
[Stage 0:===(44 + -14) 
/ 40]
15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on 
spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote 
system 
[akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has 
failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, 
spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 
lost)
15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, 
spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 
lost)
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 2
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 2
[Stage 0:==(45 + -29) / 
40]

  was:
15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on 
spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote 
system 
[akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has 
failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, 
spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 
lost)
15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, 
spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 
lost)
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 6
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 6
[Stage 
0:=(44
 + -14) / 40]
15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on 
spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote 
system 
[akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has 
failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, 
spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 
lost)
15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, 
spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 
lost)
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 2
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove 
non-existent executor 2
[Stage 
0:=(45
 + -29) / 40]


> Number of running tasks become negative after tasks lost
> 
>
> Key: SPARK-5098
> URL: https://issues.apache.org/jira/browse/SPARK-5098
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Core
>Affects Versions: 1.2.0
>Reporter: Davies Liu
>Priority: Critical
>
> 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on 
> spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
> 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote 
> system 
> [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] 
> has failed, address is now gated for [5000] ms. Reason is: [Disassociated]