err... you are right ... lspci | grep -i eth 05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet (rev 12) 09:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet (rev 12)
thanks for your reply 2010/4/29 Martin Knoblauch <kn...@knobisoft.de> > Hi, > > can you tell us which NIC you are using (/sbin/lspci) and which version of > the driver? When I wrote that REMOVE_BOGUS_SPIKES hack, it was because of a > HW/FW problem in certain Broadcom devices. It was supposed to be fixed after > kernel 2.6.9. > > The debug output from gmond suggests the overflow coming from the > bytes_out counter (BO). > > And you are right, just lowering the thresholds is not useful in general. > > Cheers > Martin > > > > >From: 左扬 <weichon...@gmail.com> > >To: ganglia-developers@lists.sourceforge.net > >Sent: Wed, April 28, 2010 1:48:58 PM > >Subject: [Ganglia-developers] bogus spikes of network_report, is that a > bug on the kernel? > > > >hello dear all~ > > > >we use the ganglia to generate the network traffic report, > > > >bu because of the bogus spikes up to 400p, I can see nothing...(as the > graph in the attachment, i modified the graph.d/network_report.php, change > the unit from bytes/s to bits/s ) > > > >and I read the code and then made some tests for days > > > > > >in the libmetrics/linux/metrics.c:line 287, there is a switch, so i > re-make ganglia with CFLAGS=DREMOVE_BOGUS_SPIKES, and restart the gmond, > > > >after days, i found there were still spkes (about 4T) > > > >so I have to change the Line 292 from > > > >if ((l_bin > 1.0e13) || (l_bout > 1.0e13) || > > > >to > > > >if ((l_bin > 2.5e8) || (l_bout > 2.5e8) || /* 2Gbps , there are 2 gigabit > NIC on our server) > >> > >to avoid the spikes. > > > >I think that is not a good idea, the others may use the faster NIC, and > then I added some code in the update_ifdata() to log the contents of > '/proc/net/dev '(value of proc_net_dev.buffer) > > > > > >logs from /var/log/message: > >Apr 27 23:19:13 hostname /opt/ganglia/sbin/gmond[18465]: > >update_ifdata(BO) - Overflow in rbo: 304634803029227 -> 630666266 > >[1272381553] > >>Apr 27 23:20:13 hostname /opt/ganglia/sbin/gmond[18465]: > >update_ifdata(BO) - Overflow in rbi: 10458900526801464705 -> > >38016437180368 [1272381613] > >>Apr 27 23:20:13 hostname /opt/ganglia/sbin/gmond[18465]: > >update_ifdata(BO) - Overflow in rpo: 219388676028 -> 219365592250 > >[1272381613] > > > > > >logs for the /proc/net/dev > > > >>------------------ 1272381433.117603 ----------------- > >>Inter-| Receive | > Transmit > >>face |bytes packets errs drop fifo frame compressed multicast|bytes > packets errs drop fifo colls carrier compressed > >>lo:3143390051 39831988 0 0 0 0 0 0 > 3143390051 39831988 0 0 0 0 0 0 > >>tunl0: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > >>eth0:38015520377153 135587033135 0 8587116 0 0 0 > 6 304631801519418 219359254753 0 0 0 0 0 0 > >>eth1: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > > > >>------------------ 1272381493.118502 ----------------- > >>Inter-| Receive | > Transmit > >>face |bytes packets errs drop fifo frame compressed multicast|bytes > packets errs drop fifo colls carrier compressed > >>lo:3143407797 39832216 0 0 0 0 0 0 > 3143407797 39832216 0 0 0 0 0 0 > >>tunl0: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > >>eth0:38015973907827 135588437010 0 8587116 0 0 0 > 6 304634803029227 219361451245 0 0 0 0 0 0 > >>eth1: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > > > >>------------------ 1272381553.121013 ----------------- > >>Inter-| Receive | > Transmit > >>face |bytes packets errs drop fifo frame compressed multicast|bytes > packets errs drop fifo colls carrier compressed > >>lo:3143407797 39832216 0 0 0 0 0 0 > 3143407797 39832216 0 0 0 0 0 0 > >>tunl0: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > >>eth0:10458900526801464705 135564674293 0 8587116 0 0 0 > 219363599555 630666266 219388676028 7723 0 0 0 7723 0 > >>eth1: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > > > >>------------------ 1272381613.123535 ----------------- > >>Inter-| Receive | > Transmit > >>face |bytes packets errs drop fifo frame compressed multicast|bytes > packets errs drop fifo colls carrier compressed > >>lo:3143444605 39832676 0 0 0 0 0 0 > 3143444605 39832676 0 0 0 0 0 0 > >>tunl0: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > >>eth0:38016437180368 135590918375 0 8587116 0 0 0 > 6 304640653909921 219365592250 0 0 0 0 0 0 > >>eth1: 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > > > >the value at 1272381493 is ok, but the value at 1272381553 is abnormal, > and then the value at 1272381613 recovered . > > > >I don't think this is caused by a HW error, it seems a bug on the kernel. > (we're using 2.6.20-pm and 2.6.9-34.ELsmp, both are x86_64) > > > >but i don't know much about the kernel... so is there anyone to confirm ? > > > >thanks. > > > >-- > >墙角数枝梅,凌寒独自开。 > >遥知不是雪,为有暗香来。 > > > -- 墙角数枝梅,凌寒独自开。 遥知不是雪,为有暗香来。
------------------------------------------------------------------------------
_______________________________________________ Ganglia-developers mailing list Ganglia-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-developers