Alex Angelini created SPARK-9745: ------------------------------------ Summary: Applications hangs when the last executor fails with dynamic allocation Key: SPARK-9745 URL: https://issues.apache.org/jira/browse/SPARK-9745 Project: Spark Issue Type: Bug Components: PySpark, Scheduler, YARN Affects Versions: 1.5.0 Environment: YARN + Pyspark + Dynamic Allocation Reporter: Alex Angelini
When a job has only a single executor remaining and that executor dies (due to something like an OOM), the application fails to notice that there are no executors left and it hangs indefinitely. This only happens when dynamic allocation is enabled. The following images were taken from hung application with no executors: -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org