surahman opened a new issue #3733:
URL: https://github.com/apache/incubator-heron/issues/3733
**Describe the bug**
The `stmgr_unittest` is consistently causing failures with timeouts during
the CI or when running tests with fewer execution threads available.
**To Reproduce**
Steps to reproduce the behaviour:
1. Trigger the TravisCI pipeline.
or
1. Run in an environment with only 4 or fewer cores available (no H/T).
2. Build in a Ubuntu Docker container using the provided build script.
This will not occur if you have more cores and threads at your disposal to
run the tests.
**Expected behavior**
Ideally: a passing `stmgr_unittest`.
**Isolated Issue**
The following three tests in the test suite are causing the problems.
Disabling them alleviates the problem. It is my belief that this is the result
of a deadlock or livelock situation but I am investigating further to confirm
the exact source of the problem.
https://github.com/apache/incubator-heron/blob/c32c09a0f09caca74dbf5715aa0632f0a50356c9/heron/stmgr/tests/cpp/server/stmgr_unittest.cpp#L960-L1068
https://github.com/apache/incubator-heron/blob/c32c09a0f09caca74dbf5715aa0632f0a50356c9/heron/stmgr/tests/cpp/server/stmgr_unittest.cpp#L1071-L1205
https://github.com/apache/incubator-heron/blob/c32c09a0f09caca74dbf5715aa0632f0a50356c9/heron/stmgr/tests/cpp/server/stmgr_unittest.cpp#L1328-L1441
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]