[ 
https://issues.apache.org/jira/browse/GIRAPH-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13887351#comment-13887351
 ] 

Eli Reisman commented on GIRAPH-747:
------------------------------------

Had a chance to look again and my read is this breaks non-YARN. We might need 
to adjust this patch to use another method. I do think this is a real issue and 
we should get something in to fix it.

> BspServiceMaster finishes ZooKeeper cleanup without waiting for all workers 
> to complete
> ---------------------------------------------------------------------------------------
>
>                 Key: GIRAPH-747
>                 URL: https://issues.apache.org/jira/browse/GIRAPH-747
>             Project: Giraph
>          Issue Type: Bug
>    Affects Versions: 1.0.0
>            Reporter: Chuan Lei
>            Assignee: Chuan Lei
>             Fix For: 1.0.0
>
>         Attachments: GIRAPH-747.v1.patch
>
>
> In BspServiceMaster, the function cleanUpZooKeeper should wait for the number 
> of workers and masters to complete. However, it appears that maxTasks only 
> takes workers into consideration. Consequently, the worker straggler may fail 
> to report to the ZooKeeper due to the path gets removed too early. This will 
> cause No lease on path File does not exist exception at runtime.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to