[ 
https://issues.apache.org/jira/browse/OOZIE-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13634639#comment-13634639
 ] 

Alejandro Abdelnur commented on OOZIE-1332:
-------------------------------------------

+1
                
> Flakey test TestActionCheckXCommand.testActionCheckTransientDuringMRAction
> --------------------------------------------------------------------------
>
>                 Key: OOZIE-1332
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1332
>             Project: Oozie
>          Issue Type: Bug
>          Components: tests
>    Affects Versions: trunk, 3.3.2, 4.0.0
>            Reporter: Robert Kanter
>            Assignee: Robert Kanter
>             Fix For: trunk, 4.0.0
>
>         Attachments: OOZIE-1332.patch
>
>
> We've been seeing 
> TestActionCheckXCommand.testActionCheckTransientDuringMRAction as flakey on 
> our CI tests.  
> This incorrect sequence of events was occurring:
> {noformat}
> ...
> - ActionCheckXCommand start
>   - suspend WF
> - ActionCheckXCommand end
> - ActionCheckXCommand start
>   - JT up
>   - unknown hadoop job error
>     - fail WF
> - ActionCheckXCommand end
> - ResumeXCommand cannot start (because WF fail)
> - test finish
> {noformat}
> when this correct sequence of events should have been occuring:
> {noformat}
> ...
> - ActionCheckXCommand start
>   - suspend WF
> - ActionCheckXCommand end
> - JT Up
> - ResumeXCommand start
> - ResumeXCommand end
> - ActionStartXCommand start
> - ActionStartXCommadn end
> - ActionCheckXCommand start
> - ActionCheckXCommand end
> ...
> - test finish
> {noformat}
> It turns out that the ActionCheckerService was triggering an extra 
> ActionCheckXCommand at just the wrong moment.  We should disable the 
> ActionCheckerService during this test to prevent this issue (When I 
> originally wrote the test, I incorrectly thought it was needed and that it 
> had to run more frequently to make the test faster).
> As a pre-emptive measure, we should also do this for 
> TestActionCheckXCommand.testActionCheckTransientDuringLauncher, which is very 
> similar even though we didn't see any flakiness.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to