[ 
https://issues.apache.org/jira/browse/SPARK-13580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15172810#comment-15172810
 ] 

Liyin Tang commented on SPARK-13580:
------------------------------------

Thanks [~zsxwing] for the investigation! That's very helpful !

> Driver makes no progress after failed to remove broadcast on Executor
> ---------------------------------------------------------------------
>
>                 Key: SPARK-13580
>                 URL: https://issues.apache.org/jira/browse/SPARK-13580
>             Project: Spark
>          Issue Type: Bug
>          Components: Streaming
>    Affects Versions: 1.5.2
>            Reporter: Liyin Tang
>         Attachments: driver_jstack.txt, driver_log.txt, executor_jstack, 
> stderrfiltered.txt.gz
>
>
> From Driver's log: it failed to remove broadcast data due to RPC timeout 
> exception from executor #11. And it also failed to get thread dump from 
> executor #11 due to akka.actor.ActorNotFound exception.
> After that, driver waited for executor #11 to finish one task for that job. 
> All the other tasks are finished for that job.
> However, from the executor#11's log, it didn't get that task (it got 9 other 
> tasks and finished them) 
> Since then, there is no progress in the streaming job. 
> I have attached the driver's log and jstack, executor's jstack. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to