[ 
https://issues.apache.org/jira/browse/ARTEMIS-807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15587643#comment-15587643
 ] 

Daniel Lindberg commented on ARTEMIS-807:
-----------------------------------------

Sorry, but I fail to understand this answer. 

I got this exception on the original master during failback, not failover 
(semantics I know, but just to be clear). 
As far as I know, I cannot control when a failback happens. I simply start the 
original master, the current master (original slave) will detect this, and 
initiate a failback to the original master. 
The initial failover happened without any issues as far as I can tell.

I don't see how I can 'wait' for the sync to be complete before initiating a 
failback, since I'm not in control of this process. 


> "Error on writing data! File not opened code - 6" on failback
> -------------------------------------------------------------
>
>                 Key: ARTEMIS-807
>                 URL: https://issues.apache.org/jira/browse/ARTEMIS-807
>             Project: ActiveMQ Artemis
>          Issue Type: Bug
>          Components: Broker
>    Affects Versions: 1.4.0
>            Reporter: Daniel Lindberg
>
> We are running Artemis 1.4.0 on RHEL 7.2 using a master/slave setup using 
> replication (one master and one slave). We did some failover/failback testing 
> while having light load on the broker (15 messages/second). The failover 
> worked without issues and the flow of messages was uninterupted. 
> However on failback we got several exceptions, and eventually ended up in a 
> state were both master and backup were down, resuling in our application 
> failing.
> I haven't been able to track down the meaning of "File not opened code - 6", 
> but this exception was repeated before we saw 
> "ActiveMQIllegalStateException[errorType=ILLEGAL_STATE message=AMQ119026: 
> Backup Server was not yet in sync with live]"
> Stack trace below:
> {noformat}
> 14:07:23,987 WARN  [org.apache.activemq.artemis.journal] AMQ142027: Error on 
> writing data! File not opened code - 6: java.lang.Exception: File not opened
>         at 
> org.apache.activemq.artemis.core.io.DummyCallback.onError(DummyCallback.java:36)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.io.AbstractSequentialFile$DelegateCallback.onError(AbstractSequentialFile.java:296)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.io.nio.NIOSequentialFile.internalWrite(NIOSequentialFile.java:307)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.io.nio.NIOSequentialFile.writeDirect(NIOSequentialFile.java:277)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.io.AbstractSequentialFile$LocalBufferObserver.flushBuffer(AbstractSequentialFile.java:324)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.io.buffer.TimedBuffer.flush(TimedBuffer.java:290)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.io.buffer.TimedBuffer.flush(TimedBuffer.java:262)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.io.AbstractSequentialFileFactory.deactivateBuffer(AbstractSequentialFileFactory.java:156)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.journal.impl.JournalImpl.stop(JournalImpl.java:2121)
>  [artemis-journal-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.persistence.impl.journal.JournalStorageManager.stop(JournalStorageManager.java:215)
>  [artemis-server-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.persistence.impl.journal.JournalStorageManager.stop(JournalStorageManager.java:157)
>  [artemis-server-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.replication.ReplicationEndpoint.stop(ReplicationEndpoint.java:339)
>  [artemis-server-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.server.impl.ActiveMQServerImpl.stopComponent(ActiveMQServerImpl.java:1038)
>  [artemis-server-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.server.impl.SharedNothingBackupActivation.run(SharedNothingBackupActivation.java:254)
>  [artemis-server-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.server.impl.ActiveMQServerImpl$ActivationThread.run(ActiveMQServerImpl.java:2412)
>  [artemis-server-1.4.0.jar:1.4.0]
>  
> 14:07:24,005 ERROR [org.apache.activemq.artemis.core.server] AMQ224000: 
> Failure in initialisation: 
> ActiveMQIllegalStateException[errorType=ILLEGAL_STATE message=AMQ119026: 
> Backup Server was not yet in sync with live]
>         at 
> org.apache.activemq.artemis.core.server.impl.SharedNothingBackupActivation.run(SharedNothingBackupActivation.java:310)
>  [artemis-server-1.4.0.jar:1.4.0]
>         at 
> org.apache.activemq.artemis.core.server.impl.ActiveMQServerImpl$ActivationThread.run(ActiveMQServerImpl.java:2412)
>  [artemis-server-1.4.0.jar:1.4.0]
>  
> 14:09:13,332 WARN  [org.apache.activemq.artemis.core.client] AMQ212004: 
> Failed to connect to server.
> 14:09:13,343 INFO  [org.apache.activemq.artemis.core.server] AMQ221002: 
> Apache ActiveMQ Artemis Message Broker version 1.4.0 
> [d8756440-9521-11e6-b058-005056be0eea] stopped, uptime 2 minutes
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to