[ https://issues.apache.org/jira/browse/HELIX-681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16451423#comment-16451423 ]
ASF GitHub Bot commented on HELIX-681: -------------------------------------- GitHub user zhan849 opened a pull request: https://github.com/apache/helix/pull/197 [HELIX-681] change controller msg purge timeout to larger number Changed message purge delay to 1min, updated tests accordingly. You can merge this pull request into a Git repository by running: $ git pull https://github.com/zhan849/helix harry/ctl-msg-cleanup Alternatively you can review and apply these changes as the patch at: https://github.com/apache/helix/pull/197.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #197 ---- commit 4e02cbb9945279b7085e5c725b9d966b90086cc7 Author: Harry Zhang <zhan849@...> Date: 2018-04-24T23:46:14Z [HELIX-681] change controller msg purge timeout to larger number ---- > Participant should not fail state transition on fail to delete / relay message > ------------------------------------------------------------------------------ > > Key: HELIX-681 > URL: https://issues.apache.org/jira/browse/HELIX-681 > Project: Apache Helix > Issue Type: Bug > Reporter: Hao Zhang > Priority: Major > > Currently we have a general try-catch block in HelixTask and > HelixTaskExecutor, which, upon any exception thrown from state transition > routine, will fail state transition. However there are at least the following > cases in which state transition should be considered as successful: > * When we fail to delete message after successfully handled message and > updated current state -> this is because we already completed state > transition and current state is consistent between participant and ZK > * When we fail to send out relay message > as relay message provides only > best effort of delivering messages, which has nothing to do with state > transition's results. In case of fail to relay message, controller will > resend message which ensures correctness. -- This message was sent by Atlassian JIRA (v7.6.3#76005)