[ 
https://issues.apache.org/jira/browse/ARTEMIS-3345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francesco Nigro updated ARTEMIS-3345:
-------------------------------------
    Description: 
A failing-back master forget its Node ID and on broker restart, having a 
different Node ID it can become live, without checking for any existing live 
with its old Node ID.

This is happen because of this mechanics from SharedNothingBackupActivation:

# {{SharedNothingBackupActivation::init}} is going to call 
activeMQServer.resetNodeManager that's re-creating a NodeManager with 
replicatingBackup == true
# SharedNothingBackupActivation::run is then
{code:java}
         // move all data away:
         activeMQServer.getNodeManager().stop();
         
activeMQServer.moveServerData(replicaPolicy.getMaxSavedReplicatedJournalsSize());
         activeMQServer.getNodeManager().start();
{code}




  was:
A failing-back master forget its Node ID if any error happen on failover and 
restart as an empty backup. 
On broker restart, it got a different NodeID and become live.


> Shared-Nothing Replication Master loose its Node ID on failed fail-back
> -----------------------------------------------------------------------
>
>                 Key: ARTEMIS-3345
>                 URL: https://issues.apache.org/jira/browse/ARTEMIS-3345
>             Project: ActiveMQ Artemis
>          Issue Type: Bug
>            Reporter: Francesco Nigro
>            Assignee: Francesco Nigro
>            Priority: Major
>
> A failing-back master forget its Node ID and on broker restart, having a 
> different Node ID it can become live, without checking for any existing live 
> with its old Node ID.
> This is happen because of this mechanics from SharedNothingBackupActivation:
> # {{SharedNothingBackupActivation::init}} is going to call 
> activeMQServer.resetNodeManager that's re-creating a NodeManager with 
> replicatingBackup == true
> # SharedNothingBackupActivation::run is then
> {code:java}
>          // move all data away:
>          activeMQServer.getNodeManager().stop();
>          
> activeMQServer.moveServerData(replicaPolicy.getMaxSavedReplicatedJournalsSize());
>          activeMQServer.getNodeManager().start();
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to