Summary: osaf: add function to return takeover request expiry time [#3029]
Review request for Ticket(s): 3029
Peer Reviewer(s): Minh, Hans 
Pull request to: *** LIST THE PERSON WITH PUSH ACCESS HERE ***
Affected branch(es): develop
Development branch: ticket-3029
Base revision: 4f86e371d28a385f689011a0effef8aaae65e713
Personal repository: git://git.code.sf.net/u/userid-2226215/review

--------------------------------
Impacted area       Impact y/n
--------------------------------
 Docs                    n
 Build system            n
 RPM/packaging           n
 Configuration files     n
 Startup scripts         n
 SAF services            y 
 OpenSAF services        y 
 Core libraries          n
 Samples                 n
 Tests                   n
 Other                   n


Comments (indicate scope for each "y" above):
---------------------------------------------

revision 1f48477cdcd92356cd446ad81741f9373724be7c
Author: Gary Lee <gary....@dektech.com.au>
Date:   Wed, 3 Jul 2019 16:19:17 +1000

amfd: improve controller failover behavior [#3029]

If consensus service is enabled, only perform node failover
after peer controller has self-fenced
(after 2 * FMS_TAKEOVER_REQUEST_VALID_TIME seconds).

This also means if node failover delay is set to a large value,
we do not unnecesarily wait too long before failing over assignments
previously assigned to the peer controller.

Remove unused fmd_conf_file variable.

Change some LOG_ER calls to LOG_WA.



revision 5e03fc3e30920989080f6617ca404f7f60f4a8cc
Author: Gary Lee <gary....@dektech.com.au>
Date:   Wed, 3 Jul 2019 16:19:10 +1000

fmd: add active promotion supervision timer [#3029]

Add supervision timer so controller will reboot if it cannot obtain
consensus lock within the allocation period
(2* FMS_TAKEOVER_REQUEST_VALID_TIME).

The peer controller can then safely perform a node failover
after this period of time.



revision c2a9e9d8712952526660efe678daee39f85d1d68
Author: Gary Lee <gary....@dektech.com.au>
Date:   Wed, 3 Jul 2019 15:34:36 +1000

osaf: add function to return takeover request expiry time [#3029]



Complete diffstat:
------------------
 src/amf/amfd/cb.h                  |  1 -
 src/amf/amfd/clm.cc                |  4 +-
 src/amf/amfd/main.cc               |  1 -
 src/amf/amfd/ndfsm.cc              |  8 ++--
 src/amf/amfd/ndproc.cc             | 19 ++++++++++
 src/amf/amfd/node_state.cc         | 23 +++++------
 src/amf/amfd/node_state_machine.cc | 19 ++++++++++
 src/amf/amfd/node_state_machine.h  |  2 +
 src/amf/amfd/proc.h                |  1 +
 src/fm/fmd/fm_cb.h                 |  2 +
 src/fm/fmd/fm_main.cc              | 14 ++++++-
 src/fm/fmd/fm_rda.cc               | 78 ++++++++++++++++++++++++++------------
 src/osaf/consensus/consensus.cc    |  4 ++
 src/osaf/consensus/consensus.h     |  2 +
 14 files changed, 134 insertions(+), 44 deletions(-)


Testing Commands:
-----------------
1) Ensure a 2N application is active on standby controller,
   and standy on the active controller
2) Isolate active & standby controller

Testing, Expected Results:
--------------------------
amfd should failover 2N application only after
2 * FMS_TAKEOVER_REQUEST_VALID_TIME seconds

Conditions of Submission:
-------------------------
Ack from reviewer

Arch      Built     Started    Linux distro
-------------------------------------------
mips        n          n
mips64      n          n
x86         n          n
x86_64      y          y 
powerpc     n          n
powerpc64   n          n


Reviewer Checklist:
-------------------
[Submitters: make sure that your review doesn't trigger any checkmarks!]


Your checkin has not passed review because (see checked entries):

___ Your RR template is generally incomplete; it has too many blank entries
    that need proper data filled in.

___ You have failed to nominate the proper persons for review and push.

___ Your patches do not have proper short+long header

___ You have grammar/spelling in your header that is unacceptable.

___ You have exceeded a sensible line length in your headers/comments/text.

___ You have failed to put in a proper Trac Ticket # into your commits.

___ You have incorrectly put/left internal data in your comments/files
    (i.e. internal bug tracking tool IDs, product names etc)

___ You have not given any evidence of testing beyond basic build tests.
    Demonstrate some level of runtime or other sanity testing.

___ You have ^M present in some of your files. These have to be removed.

___ You have needlessly changed whitespace or added whitespace crimes
    like trailing spaces, or spaces before tabs.

___ You have mixed real technical changes with whitespace and other
    cosmetic code cleanup changes. These have to be separate commits.

___ You need to refactor your submission into logical chunks; there is
    too much content into a single commit.

___ You have extraneous garbage in your review (merge commits etc)

___ You have giant attachments which should never have been sent;
    Instead you should place your content in a public tree to be pulled.

___ You have too many commits attached to an e-mail; resend as threaded
    commits, or place in a public tree for a pull.

___ You have resent this content multiple times without a clear indication
    of what has changed between each re-send.

___ You have failed to adequately and individually address all of the
    comments and change requests that were proposed in the initial review.

___ You have a misconfigured ~/.gitconfig file (i.e. user.name, user.email etc)

___ Your computer have a badly configured date and time; confusing the
    the threaded patch review.

___ Your changes affect IPC mechanism, and you don't present any results
    for in-service upgradability test.

___ Your changes affect user manual and documentation, your patch series
    do not contain the patch that updates the Doxygen manual.



_______________________________________________
Opensaf-devel mailing list
Opensaf-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-devel

Reply via email to