[ https://issues.apache.org/jira/browse/GIRAPH-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14005435#comment-14005435 ]
Roman Shaposhnik commented on GIRAPH-747: ----------------------------------------- [~initialcontext] Any chance you can pick this up for 1.1.0? You're our last hope ;-) > BspServiceMaster finishes ZooKeeper cleanup without waiting for all workers > to complete > --------------------------------------------------------------------------------------- > > Key: GIRAPH-747 > URL: https://issues.apache.org/jira/browse/GIRAPH-747 > Project: Giraph > Issue Type: Bug > Affects Versions: 1.0.0 > Reporter: Chuan Lei > Assignee: Chuan Lei > Fix For: 1.1.0 > > Attachments: GIRAPH-747.v1.patch > > > In BspServiceMaster, the function cleanUpZooKeeper should wait for the number > of workers and masters to complete. However, it appears that maxTasks only > takes workers into consideration. Consequently, the worker straggler may fail > to report to the ZooKeeper due to the path gets removed too early. This will > cause No lease on path File does not exist exception at runtime. -- This message was sent by Atlassian JIRA (v6.2#6252)