GitHub user dlogothetis opened a pull request:
https://github.com/apache/giraph/pull/84
Fix BspServiceMaster bug
Occasionally, BspServiceMaster fails to detect that a worker finished a
superstep (e.g. input superstep, or a compute supestep). As a result, the job
may get stuck as the master keeps waiting for all workers to finish.
While this is hard to reproduce, this fixes a potential cause for this.
Tests
- Unit tests
- Internal snapshot tests
- Ran large job
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/dlogothetis/giraph fix_bsp
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/giraph/pull/84.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #84
----
commit 507956f1a053510be2e4d41cdf0d3444bddd4bbb
Author: Dionysios Logothetis <dlogothetis@...>
Date: 2018-09-18T16:30:20Z
Fix BspServiceMaster bug
----
---