Hi, On Thu, Oct 25, 2007 at 12:59:14PM +0200, Lars Marowsky-Bree wrote: > Hi all, > > on my 7 node cluster, I see the occasional - every 5-10 tests - bunch of > messages dropped during a burst; usually on the DC (what a surprise), on > the order of ~200 messages dropped per incident. > > This occurs only with debug 1, and only above >5 nodes or so. > > So yes, my cluster is fully virtualized. However, the physical host has > 8 x 2.66 Ghz cores; the guests don't write the messages to their own > image, but relay it via syslog-ng to the host, where it gets "written" > to a RAM disk, so no IO bottleneck. Each guest essentially has 1 core to > itself + 512MB RAM. > > The network is fully virtual, so I can't be hitting that limit.
Probably your xen is better than mine. Here I have a transfer rate (guest to host) at times around 10mbit. > syslog-ng is running with a fifosize of 40000 lines, and I upped logd to > 2048 sendqlen/recvqlen. > > As a data point: I was experiencing the very same drop message rate and > doubled the buffers on syslog-ng and logd then; no change. > > Any suggestions? Remember this one: http://lists.linux-ha.org/pipermail/linux-ha-dev/2007-April/014378.html Cheers, Dejan > > Regards, > Lars > > -- > Teamlead Kernel, SuSE Labs, Research and Development > SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg) > "Experience is the name everyone gives to their mistakes." -- Oscar Wilde > > _______________________________________________________ > Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org > http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev > Home Page: http://linux-ha.org/ _______________________________________________________ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev Home Page: http://linux-ha.org/