[ 
https://issues.apache.org/jira/browse/HELIX-681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16451423#comment-16451423
 ] 

ASF GitHub Bot commented on HELIX-681:
--------------------------------------

GitHub user zhan849 opened a pull request:

    https://github.com/apache/helix/pull/197

    [HELIX-681] change controller msg purge timeout to larger number

    Changed message purge delay to 1min, updated tests accordingly.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zhan849/helix harry/ctl-msg-cleanup

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/helix/pull/197.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #197
    
----
commit 4e02cbb9945279b7085e5c725b9d966b90086cc7
Author: Harry Zhang <zhan849@...>
Date:   2018-04-24T23:46:14Z

    [HELIX-681] change controller msg purge timeout to larger number

----


> Participant should not fail state transition on fail to delete / relay message
> ------------------------------------------------------------------------------
>
>                 Key: HELIX-681
>                 URL: https://issues.apache.org/jira/browse/HELIX-681
>             Project: Apache Helix
>          Issue Type: Bug
>            Reporter: Hao Zhang
>            Priority: Major
>
> Currently we have a general try-catch block in HelixTask and 
> HelixTaskExecutor, which, upon any exception thrown from state transition 
> routine, will fail state transition. However there are at least the following 
> cases in which state transition should be considered as successful:
>  * When we fail to delete message after successfully handled message and 
> updated current state -> this is because we already completed state 
> transition and current state is consistent between participant and ZK
>  * When we fail to send out relay message > as relay message provides only 
> best effort of delivering messages, which has nothing to do with state 
> transition's results. In case of fail to relay message, controller will 
> resend message which ensures correctness.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to