[jira] [Work logged] (ARTEMIS-2892) Replication stopped working after many failovers

2024-06-13 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARTEMIS-2892?focusedWorklogId=923387&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-923387
 ]

ASF GitHub Bot logged work on ARTEMIS-2892:
---

Author: ASF GitHub Bot
Created on: 13/Jun/24 21:48
Start Date: 13/Jun/24 21:48
Worklog Time Spent: 10m 
  Work Description: clebertsuconic merged PR #4978:
URL: https://github.com/apache/activemq-artemis/pull/4978




Issue Time Tracking
---

Worklog Id: (was: 923387)
Time Spent: 20m  (was: 10m)

> Replication stopped working after many failovers
> 
>
> Key: ARTEMIS-2892
> URL: https://issues.apache.org/jira/browse/ARTEMIS-2892
> Project: ActiveMQ Artemis
>  Issue Type: Bug
>  Components: Broker
>Affects Versions: 2.11.0
>Reporter: Chris Oman
>Priority: Major
>  Labels: replication
> Attachments: artemis-1017.log, artemis-1019.log
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> We have 2 servers that we have been testing the failover capabilities with. 
> We have encountered a situation where replication has completely stopped, but 
> the master is working properly otherwise.
> When the backup attempts to connect, the master has this in its logs.
> {code:java}
> 2020-09-01 11:23:31,177 WARN  [org.apache.activemq.artemis.core.server] 
> AMQ222013: Error when trying to start replication: 
> ActiveMQInternalErrorException[errorType=INTERNAL_ERROR message=AMQ229006: 
> journals are not JournalImpl. You can''t set a replicator!]
> at 
> org.apache.activemq.artemis.core.persistence.impl.journal.JournalStorageManager.startReplication(JournalStorageManager.java:628)
>  [artemis-server-2.11.0.jar:2.11.0]
> at 
> org.apache.activemq.artemis.core.server.impl.SharedNothingLiveActivation$2.run(SharedNothingLiveActivation.java:178)
>  [artemis-server-2.11.0.jar:2.11.0]
> at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_201]
> {code}
> How can this happen? Up until this point, replication was working properly 
> through all of the failovers (10+).
> I have attached the logs files from both systems.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@activemq.apache.org
For additional commands, e-mail: issues-h...@activemq.apache.org
For further information, visit: https://activemq.apache.org/contact




[jira] [Work logged] (ARTEMIS-2892) Replication stopped working after many failovers

2024-06-13 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARTEMIS-2892?focusedWorklogId=923384&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-923384
 ]

ASF GitHub Bot logged work on ARTEMIS-2892:
---

Author: ASF GitHub Bot
Created on: 13/Jun/24 20:44
Start Date: 13/Jun/24 20:44
Worklog Time Spent: 10m 
  Work Description: clebertsuconic opened a new pull request, #4978:
URL: https://github.com/apache/activemq-artemis/pull/4978

   (no comment)




Issue Time Tracking
---

Worklog Id: (was: 923384)
Remaining Estimate: 0h
Time Spent: 10m

> Replication stopped working after many failovers
> 
>
> Key: ARTEMIS-2892
> URL: https://issues.apache.org/jira/browse/ARTEMIS-2892
> Project: ActiveMQ Artemis
>  Issue Type: Bug
>  Components: Broker
>Affects Versions: 2.11.0
>Reporter: Chris Oman
>Priority: Major
>  Labels: replication
> Attachments: artemis-1017.log, artemis-1019.log
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> We have 2 servers that we have been testing the failover capabilities with. 
> We have encountered a situation where replication has completely stopped, but 
> the master is working properly otherwise.
> When the backup attempts to connect, the master has this in its logs.
> {code:java}
> 2020-09-01 11:23:31,177 WARN  [org.apache.activemq.artemis.core.server] 
> AMQ222013: Error when trying to start replication: 
> ActiveMQInternalErrorException[errorType=INTERNAL_ERROR message=AMQ229006: 
> journals are not JournalImpl. You can''t set a replicator!]
> at 
> org.apache.activemq.artemis.core.persistence.impl.journal.JournalStorageManager.startReplication(JournalStorageManager.java:628)
>  [artemis-server-2.11.0.jar:2.11.0]
> at 
> org.apache.activemq.artemis.core.server.impl.SharedNothingLiveActivation$2.run(SharedNothingLiveActivation.java:178)
>  [artemis-server-2.11.0.jar:2.11.0]
> at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_201]
> {code}
> How can this happen? Up until this point, replication was working properly 
> through all of the failovers (10+).
> I have attached the logs files from both systems.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@activemq.apache.org
For additional commands, e-mail: issues-h...@activemq.apache.org
For further information, visit: https://activemq.apache.org/contact