Can you please describe the system HW called out below? Nothing is coming up when I search for "Allnet FW 8888". You are also getting dropped packets (rx_missed_errors). Are those also happening on the 82572 devices? The 82541 are PCI devices and the 82572 are PCIe. We'll know a bit more with a description of the system HW.
Cheers, John ----------------------------------------------------------- "...that your people will judge you on what you can build, not what you destroy.", B. Obama, 2009 >-----Original Message----- >From: Lars Ehrhardt [mailto:[email protected]] >Sent: Sunday, March 14, 2010 3:57 AM >To: [email protected] >Subject: [E1000-devel] Network stalls with e1000 driver and >82541 network chips > >Hello, > >like some have reported before, we also have problems with the e1000 >driver and the 82541GI network chips. During the day we have complete >network outages with those chips ranging from some seconds up to a >minute. We have tried different kernels (Base OS is Debian Lenny i686) >and we have tried 2.6.26, 2.6.30 (backports) and 2.6.32 (backports) >kernel images. I have also compiled the latest 8.0.19-NAPI driver for >2.6.30 but the problems persist. We have disabled all offloading >settings for tests but that did not change anything as well. >Problems do >not seem to be related to system load or high network traffic, it just >happens randomly. We are seeing this behaviour on different Allnet FW >8888 devices which are equipped with two 82541 ports and three 82572EI >chips. The problems are limited to the 82541 ports, if we switch to the >82572EI ports everything works flawlessly. > >I am happy to provide more information. > >The following entry appeared in our dmesg today: > >[57731.000029] ------------[ cut here ]------------ >[57731.000057] WARNING: at >/build/buildd/linux-2.6-2.6.30/debian/build/source_i386_none/ne >t/sched/sch_generic.c:226 >dev_watchdog+0xa8/0x135() >[57731.000071] Hardware name: 945GSE >[57731.000079] NETDEV WATCHDOG: aur-mgt (e1000): transmit timed out >[57731.000087] Modules linked in: e1000 tun sit tunnel4 act_police >cls_flow cls_fw cls_u32 sch_htb sch_hfsc sch_ingress sch_sfq xt_time >xt_connlimit xt_realm iptable_raw xt_comment xt_recent xt_policy >ipt_ULOG ipt_REJECT ipt_REDIRECT ipt_NETMAP ipt_MASQUERADE ipt_ECN >ipt_ecn ipt_CLUSTERIP ipt_ah ipt_addrtype nf_nat_tftp nf_nat_snmp_basic >nf_nat_sip nf_nat_pptp nf_nat_proto_gre nf_nat_irc ppp_deflate >nf_nat_h323 zlib_deflate nf_nat_ftp nf_nat_amanda ts_kmp >nf_conntrack_amanda nf_conntrack_sane nf_conntrack_tftp >nf_conntrack_sip >nf_conntrack_proto_sctp bsd_comp nf_conntrack_pptp >nf_conntrack_proto_gre nf_conntrack_netlink nf_conntrack_netbios_ns >nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp xt_tcpmss >xt_pkttype >xt_physdev xt_owner xt_NFQUEUE xt_NFLOG nfnetlink_log xt_multiport >xt_MARK xt_mark xt_mac xt_limit xt_length xt_iprange xt_helper >xt_hashlimit xt_DSCP xt_dscp xt_dccp xt_conntrack xt_CONNMARK >xt_connmark xt_CLASSIFY ipt_LOG xt_tcpudp xt_state iptable_nat nf_nat >nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack iptable_mangle nfnetlink >iptable_filter ip_tables x_tables ppp_async crc_ccitt ppp_generic slhc >8021q garp stp loop arc4 ecb evdev ath9k mac80211 rfkill cfg80211 >led_class parport_pc parport snd_pcm i2c_i801 snd_timer >processor button >i2c_core serio_raw snd soundcore snd_page_alloc rng_core intel_agp >agpgart pcspkr ext3 jbd mbcache dm_mirror dm_region_hash dm_log >dm_snapshot dm_mod raid1 md_mod sd_mod crc_t10dif piix ide_pci_generic >ide_core ata_generic ahci libata uhci_hcd ehci_hcd scsi_mod usbcore >e1000e thermal fan thermal_sys [last unloaded: e1000] >[57731.000536] Pid: 0, comm: swapper Not tainted 2.6.30-bpo.2-686 #1 >[57731.000545] Call Trace: >[57731.000564] [<c0127160>] ? warn_slowpath_common+0x5e/0x8a >[57731.000580] [<c029f614>] ? dev_watchdog+0x0/0x135 >[57731.000594] [<c01271be>] ? warn_slowpath_fmt+0x26/0x2a >[57731.000608] [<c029f6bc>] ? dev_watchdog+0xa8/0x135 >[57731.000625] [<c012e6bf>] ? lock_timer_base+0x19/0x35 >[57731.000639] [<c012e7a4>] ? __mod_timer+0xc9/0xd3 >[57731.000654] [<c012e517>] ? run_timer_softirq+0x13d/0x19d >[57731.000668] [<c029f614>] ? dev_watchdog+0x0/0x135 >[57731.000685] [<c012b25b>] ? __do_softirq+0x8e/0x135 >[57731.000700] [<c012b330>] ? do_softirq+0x2e/0x38 >[57731.000715] [<c012b413>] ? irq_exit+0x26/0x53 >[57731.000730] [<c01105b6>] ? smp_apic_timer_interrupt+0x6c/0x76 >[57731.000747] [<c0103946>] ? apic_timer_interrupt+0x2a/0x30 >[57731.000762] [<c013007b>] ? flush_signals+0xe/0x4e >[57731.000777] [<c0108a4d>] ? mwait_idle+0x67/0x83 >[57731.000792] [<c010237a>] ? cpu_idle+0x46/0x60 >[57731.000809] [<c044476d>] ? start_kernel+0x2c2/0x2c5 >[57731.000819] ---[ end trace c09cf38d283cbbf6 ]--- > >gw:~# ethtool -i aur-mgt >driver: e1000 >version: 8.0.19-NAPI >firmware-version: N/A >bus-info: 0000:05:0b.0 > >gw:~# ethtool -S aur-mgt >NIC statistics: > rx_packets: 24551123 > tx_packets: 43793850 > rx_bytes: 2087115200 > tx_bytes: 62061689856 > rx_broadcast: 11452 > tx_broadcast: 275 > rx_multicast: 26143 > tx_multicast: 7925 > rx_errors: 0 > tx_errors: 0 > tx_dropped: 0 > multicast: 26143 > collisions: 0 > rx_length_errors: 0 > rx_over_errors: 0 > rx_crc_errors: 0 > rx_frame_errors: 0 > rx_no_buffer_count: 446 > rx_missed_errors: 1762 > tx_aborted_errors: 0 > tx_carrier_errors: 0 > tx_fifo_errors: 0 > tx_heartbeat_errors: 0 > tx_window_errors: 0 > tx_abort_late_coll: 0 > tx_deferred_ok: 0 > tx_single_coll_ok: 0 > tx_multi_coll_ok: 0 > tx_timeout_count: 1 > tx_restart_queue: 0 > rx_long_length_errors: 0 > rx_short_length_errors: 0 > rx_align_errors: 0 > tx_tcp_seg_good: 13136 > tx_tcp_seg_failed: 0 > rx_flow_control_xon: 0 > rx_flow_control_xoff: 0 > tx_flow_control_xon: 0 > tx_flow_control_xoff: 0 > rx_long_byte_count: 2087115200 > rx_csum_offload_good: 23278564 > rx_csum_offload_errors: 1 > alloc_rx_buff_failed: 0 > tx_smbus: 0 > rx_smbus: 0 > dropped_smbus: 0 > > > >gw:~# ethtool -e aur-mgt > >Offset Values >------ ------ >0x0000 00 0f c9 04 db 81 10 02 ff ff 00 10 ff ff ff ff >0x0010 ff ff ff ff 0b 64 76 10 86 80 76 10 86 80 84 b2 >0x0020 dd 20 22 22 00 00 90 2f 80 23 12 00 20 1e 12 00 >0x0030 20 1e 12 00 20 1e 12 00 20 1e 09 00 00 02 00 00 >0x0040 0c 00 a6 93 0b 28 00 00 00 0c ff ff ff ff ff ff >0x0050 ff ff ff ff ff ff ff ff ff ff ff ff ff ff 02 06 >0x0060 00 01 00 40 10 12 07 40 ff ff ff ff ff ff ff ff >0x0070 ff ff ff ff ff ff ff ff ff ff ff ff ff ff e3 68 > >gw:~# cat /proc/interrupts > CPU0 CPU1 > 0: 219 0 IO-APIC-edge timer > 1: 2 0 IO-APIC-edge i8042 > 3: 57597 0 IO-APIC-edge serial > 4: 11 0 IO-APIC-edge > 7: 0 0 IO-APIC-edge parport0 > 8: 56 0 IO-APIC-edge rtc0 > 9: 0 0 IO-APIC-fasteoi acpi > 14: 0 0 IO-APIC-edge ide0 > 16: 49 0 IO-APIC-fasteoi uhci_hcd:usb4, ath > 18: 67195406 0 IO-APIC-fasteoi uhci_hcd:usb3, aur-mgt > 19: 51743 0 IO-APIC-fasteoi uhci_hcd:usb2, gst > 23: 0 0 IO-APIC-fasteoi uhci_hcd:usb1, >ehci_hcd:usb5 > 28: 48717 0 PCI-MSI-edge pbr-Q0 > 29: 2 0 PCI-MSI-edge pbr > 30: 4223924 0 PCI-MSI-edge dmz-Q0 > 31: 158 0 PCI-MSI-edge dmz > 32: 58847077 0 PCI-MSI-edge inet-Q0 > 33: 245438 0 PCI-MSI-edge inet > 35: 354469 0 PCI-MSI-edge ahci > NMI: 0 0 Non-maskable interrupts > LOC: 6729669 6541747 Local timer interrupts > SPU: 0 0 Spurious interrupts > RES: 33615 25991 Rescheduling interrupts > CAL: 64 239 Function call interrupts > TLB: 7703 4053 TLB shootdowns > TRM: 0 0 Thermal event interrupts > ERR: 0 > MIS: 0 > > >gw:~# lspci -vvv > >see: http://pastebin.com/MwniCbET > >Cheers, > >Lars > > > >--------------------------------------------------------------- >--------------- >Download Intel® Parallel Studio Eval >Try the new software tools for yourself. Speed compiling, find bugs >proactively, and fine-tune applications for parallel performance. >See why Intel Parallel Studio got high marks during beta. >http://p.sf.net/sfu/intel-sw-dev >_______________________________________________ >E1000-devel mailing list >[email protected] >https://lists.sourceforge.net/lists/listinfo/e1000-devel >To learn more about Intel® Ethernet, visit >http://communities.intel.com/community/wired > ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ E1000-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/e1000-devel To learn more about Intel® Ethernet, visit http://communities.intel.com/community/wired
