[jira] [Commented] (AMQ-4987) io wait on replicated levelDB slaves

anselme dewavrin (JIRA) Thu, 23 Jan 2014 01:54:33 -0800

    [ 
https://issues.apache.org/jira/browse/AMQ-4987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13879805#comment-13879805
 ]


anselme dewavrin commented on AMQ-4987:
---------------------------------------

Dear All,

The iowait and huge disk activity is due to frequent fsyncs on the slaves. I 
demonstrated this by preloading a library that disables fsyncs (with a 
LD_PRELOAD=libeatmydata.so). With this trick the iowait disappear.

I think the leveldb replicated stores is in itself secure enough without 
fsync-ing each update, because of the synchronous replication (sync=quorum_mem) 
on different machines that should fail within the same seconds. This is why the 
many fsyncs on the slaves are not useful, in my opinion.

For my purpose I will live with the LD_PRELOAD, but it could be profitable for 
the community to make the levelDB replicated store evolve.

Anselme

> io wait on replicated levelDB slaves
> ------------------------------------
>
>                 Key: AMQ-4987
>                 URL: https://issues.apache.org/jira/browse/AMQ-4987
>             Project: ActiveMQ
>          Issue Type: Test
>          Components: activemq-leveldb-store
>    Affects Versions: 5.9.0
>         Environment: debian VM 2.6.32-5-amd64, jdk7
>            Reporter: anselme dewavrin
>            Priority: Minor
>
> Dear all,
> I set up a 3-nodes replicatedLevelDB activeMQ cluster as explained on the 
> activemq site.
> I made a message injector using the php stomp client described here : 
> http://stomp.fusesource.org/documentation/php/book.html
> Then I injected persistent messages as fast as possible (giving about 600 
> messages/s).
> Everything works fine, then I measured the servers' activity with "vmstat 1". 
> I saw no iowait on the master node, but  20% on both slaves. This would 
> impeach scalabitity I suppose. And the iowait is justified by 3000 bo/s 
> (blocks out) in the vmstat report.
> The machines are not swapping (paging).
> Here is what I tried, without success :
> -specify explicitly sync="quorum_mem"
> -JNI implementation of the leveldb store (and verified it is used)
> -setting flushDelay to 2000
> Does anyone have an idea that I could try ? Why is the leveldb slaves writing 
> so much to disk ?
> Many thanks in advance
> Yours,
> Anselme



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (AMQ-4987) io wait on replicated levelDB slaves

Reply via email to