Exept from the linux version you are running, did you tune your disk IO 
buffers for host and guest, and are you using pvscsi?

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2053145

We walked onwards this in another occasion.


On Tuesday, February 24, 2015 at 1:50:40 PM UTC+1, sun...@sunner.com wrote:
>
> Hello,
>
> With the release of 1.0 we've started moving towards a new cluster of GL 
> hosts. These are working very well, with one exception.
> For some reason any reasonably significant UDP traffic will choke the 
> message processor, fill up and process buffers on all four hosts, and 
> effectively choke up all other message processing as well.
> Normally we do around 2k messages per second, split roughly 50/50 between 
> TCP and UDP. Sending the entire TCP load to one host doesn't present a 
> problem, it doesn't break a sweat.
>
> I've also experimented a little with sending a large text file using 
> rsyslog's imfile module, sending it via TCP will bottleneck us at the ES 
> side of things and cause the disk journal fill up fairly rapidly, but it's 
> still working at at ~9k messages per second so that's fine. Sending it via 
> UDP just causes GL to choke again, fill up the journal to a certain point 
> and slowly slowly process the journal at little bursts of a few thousand 
> messages followed by several seconds of apparent sleeping(i.e pretty much 
> no CPU usage).
>
> During all of this the input buffer never fills up more than at most 
> single digit percentages, using TCP the output buffer sometimes moves up to 
> 20-30%, with UDP it never moves at all. It's all in the process buffer. 
> Sending a large burst of messages and then stopping doesn't seem to affect 
> this behavior either, even after the inbound messages stop it still takes a 
> long time to process the messages that are already in the journal and 
> process buffer.
> I'm using VisualVM to look at the CPU and memory usage, this is a 
> screenshot of a UDP session:
> http://i59.tinypic.com/x23xfl.png
>
> I've tried mucking around with various knobs, processbuffer_processors, 
> JVM settings, etc, with no results whatsoever, good or bad.
> There's nothing to suggest a problem in neither the graylog nor system 
> logs.
>
> Pertinent specs and settings:
> ring_size = 16384 (CPU's have 20 MB L3)
> processbuffer_processors = 5
>
> Java 8u31
> Using G1GC with StringDeduplication, I've tried without the latter and 
> just using CMC as well, no difference.
> 4 GB Xmx/Xms.
> Linux 3.16.0
> net.core.rmem_max = 8388608
>
> These are virtual machines, VMware, 8 GB / 8 vCPU's, Xeon E5-2690's.
>
> Software wise the old nodes are running the same setup more or less, 
> except kernel 3.2.0, same JVM, G1GC, etc. Hardware wise, they're physical 
> boxes, old Dell 2950's with dual quad core E5440's. That's Core2 era so 
> quite a bit slower.
>
> Any ideas?
>

-- 
You received this message because you are subscribed to the Google Groups 
"graylog2" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to graylog2+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to