Can you please describe the system HW called out below?  Nothing is coming up 
when I search for "Allnet FW 8888".  You are also getting dropped packets 
(rx_missed_errors).  Are those also happening on the 82572 devices?  The  82541 
are PCI devices and the 82572 are PCIe.  We'll know a bit more with a 
description of the system HW.

Cheers,
John
-----------------------------------------------------------
"...that your people will judge you on what you can build, not what you 
destroy.", B. Obama, 2009 

 

>-----Original Message-----
>From: Lars Ehrhardt [mailto:[email protected]] 
>Sent: Sunday, March 14, 2010 3:57 AM
>To: [email protected]
>Subject: [E1000-devel] Network stalls with e1000 driver and 
>82541 network chips
>
>Hello,
>
>like some have reported before, we also have problems with the e1000
>driver and the 82541GI network chips. During the day we have complete
>network outages with those chips ranging from some seconds up to a
>minute. We have tried different kernels (Base OS is Debian Lenny i686)
>and we have tried 2.6.26, 2.6.30 (backports) and 2.6.32 (backports)
>kernel images. I have also compiled the latest 8.0.19-NAPI driver for
>2.6.30 but the problems persist. We have disabled all offloading
>settings for tests but that did not change anything as well. 
>Problems do
>not seem to be related to system load or high network traffic, it just
>happens randomly. We are seeing this behaviour on different Allnet FW
>8888 devices which are equipped with two 82541 ports and three 82572EI
>chips. The problems are limited to the 82541 ports, if we switch to the
>82572EI ports everything works flawlessly.
>
>I am happy to provide more information.
>
>The following entry appeared in our dmesg today:
>
>[57731.000029] ------------[ cut here ]------------
>[57731.000057] WARNING: at
>/build/buildd/linux-2.6-2.6.30/debian/build/source_i386_none/ne
>t/sched/sch_generic.c:226
>dev_watchdog+0xa8/0x135()
>[57731.000071] Hardware name: 945GSE
>[57731.000079] NETDEV WATCHDOG: aur-mgt (e1000): transmit timed out
>[57731.000087] Modules linked in: e1000 tun sit tunnel4 act_police
>cls_flow cls_fw cls_u32 sch_htb sch_hfsc sch_ingress sch_sfq xt_time
>xt_connlimit xt_realm iptable_raw xt_comment xt_recent xt_policy
>ipt_ULOG ipt_REJECT ipt_REDIRECT ipt_NETMAP ipt_MASQUERADE ipt_ECN
>ipt_ecn ipt_CLUSTERIP ipt_ah ipt_addrtype nf_nat_tftp nf_nat_snmp_basic
>nf_nat_sip nf_nat_pptp nf_nat_proto_gre nf_nat_irc ppp_deflate
>nf_nat_h323 zlib_deflate nf_nat_ftp nf_nat_amanda ts_kmp
>nf_conntrack_amanda nf_conntrack_sane nf_conntrack_tftp 
>nf_conntrack_sip
>nf_conntrack_proto_sctp bsd_comp nf_conntrack_pptp
>nf_conntrack_proto_gre nf_conntrack_netlink nf_conntrack_netbios_ns
>nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp xt_tcpmss 
>xt_pkttype
>xt_physdev xt_owner xt_NFQUEUE xt_NFLOG nfnetlink_log xt_multiport
>xt_MARK xt_mark xt_mac xt_limit xt_length xt_iprange xt_helper
>xt_hashlimit xt_DSCP xt_dscp xt_dccp xt_conntrack xt_CONNMARK
>xt_connmark xt_CLASSIFY ipt_LOG xt_tcpudp xt_state iptable_nat nf_nat
>nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack iptable_mangle nfnetlink
>iptable_filter ip_tables x_tables ppp_async crc_ccitt ppp_generic slhc
>8021q garp stp loop arc4 ecb evdev ath9k mac80211 rfkill cfg80211
>led_class parport_pc parport snd_pcm i2c_i801 snd_timer 
>processor button
>i2c_core serio_raw snd soundcore snd_page_alloc rng_core intel_agp
>agpgart pcspkr ext3 jbd mbcache dm_mirror dm_region_hash dm_log
>dm_snapshot dm_mod raid1 md_mod sd_mod crc_t10dif piix ide_pci_generic
>ide_core ata_generic ahci libata uhci_hcd ehci_hcd scsi_mod usbcore
>e1000e thermal fan thermal_sys [last unloaded: e1000]
>[57731.000536] Pid: 0, comm: swapper Not tainted 2.6.30-bpo.2-686 #1
>[57731.000545] Call Trace:
>[57731.000564]  [<c0127160>] ? warn_slowpath_common+0x5e/0x8a
>[57731.000580]  [<c029f614>] ? dev_watchdog+0x0/0x135
>[57731.000594]  [<c01271be>] ? warn_slowpath_fmt+0x26/0x2a
>[57731.000608]  [<c029f6bc>] ? dev_watchdog+0xa8/0x135
>[57731.000625]  [<c012e6bf>] ? lock_timer_base+0x19/0x35
>[57731.000639]  [<c012e7a4>] ? __mod_timer+0xc9/0xd3
>[57731.000654]  [<c012e517>] ? run_timer_softirq+0x13d/0x19d
>[57731.000668]  [<c029f614>] ? dev_watchdog+0x0/0x135
>[57731.000685]  [<c012b25b>] ? __do_softirq+0x8e/0x135
>[57731.000700]  [<c012b330>] ? do_softirq+0x2e/0x38
>[57731.000715]  [<c012b413>] ? irq_exit+0x26/0x53
>[57731.000730]  [<c01105b6>] ? smp_apic_timer_interrupt+0x6c/0x76
>[57731.000747]  [<c0103946>] ? apic_timer_interrupt+0x2a/0x30
>[57731.000762]  [<c013007b>] ? flush_signals+0xe/0x4e
>[57731.000777]  [<c0108a4d>] ? mwait_idle+0x67/0x83
>[57731.000792]  [<c010237a>] ? cpu_idle+0x46/0x60
>[57731.000809]  [<c044476d>] ? start_kernel+0x2c2/0x2c5
>[57731.000819] ---[ end trace c09cf38d283cbbf6 ]---
>
>gw:~# ethtool -i aur-mgt
>driver: e1000
>version: 8.0.19-NAPI
>firmware-version: N/A
>bus-info: 0000:05:0b.0
>
>gw:~# ethtool -S aur-mgt
>NIC statistics:
>      rx_packets: 24551123
>      tx_packets: 43793850
>      rx_bytes: 2087115200
>      tx_bytes: 62061689856
>      rx_broadcast: 11452
>      tx_broadcast: 275
>      rx_multicast: 26143
>      tx_multicast: 7925
>      rx_errors: 0
>      tx_errors: 0
>      tx_dropped: 0
>      multicast: 26143
>      collisions: 0
>      rx_length_errors: 0
>      rx_over_errors: 0
>      rx_crc_errors: 0
>      rx_frame_errors: 0
>      rx_no_buffer_count: 446
>      rx_missed_errors: 1762
>      tx_aborted_errors: 0
>      tx_carrier_errors: 0
>      tx_fifo_errors: 0
>      tx_heartbeat_errors: 0
>      tx_window_errors: 0
>      tx_abort_late_coll: 0
>      tx_deferred_ok: 0
>      tx_single_coll_ok: 0
>      tx_multi_coll_ok: 0
>      tx_timeout_count: 1
>      tx_restart_queue: 0
>      rx_long_length_errors: 0
>      rx_short_length_errors: 0
>      rx_align_errors: 0
>      tx_tcp_seg_good: 13136
>      tx_tcp_seg_failed: 0
>      rx_flow_control_xon: 0
>      rx_flow_control_xoff: 0
>      tx_flow_control_xon: 0
>      tx_flow_control_xoff: 0
>      rx_long_byte_count: 2087115200
>      rx_csum_offload_good: 23278564
>      rx_csum_offload_errors: 1
>      alloc_rx_buff_failed: 0
>      tx_smbus: 0
>      rx_smbus: 0
>      dropped_smbus: 0
>
>
>
>gw:~# ethtool -e aur-mgt
>
>Offset          Values
>------          ------
>0x0000          00 0f c9 04 db 81 10 02 ff ff 00 10 ff ff ff ff
>0x0010          ff ff ff ff 0b 64 76 10 86 80 76 10 86 80 84 b2
>0x0020          dd 20 22 22 00 00 90 2f 80 23 12 00 20 1e 12 00
>0x0030          20 1e 12 00 20 1e 12 00 20 1e 09 00 00 02 00 00
>0x0040          0c 00 a6 93 0b 28 00 00 00 0c ff ff ff ff ff ff
>0x0050          ff ff ff ff ff ff ff ff ff ff ff ff ff ff 02 06
>0x0060          00 01 00 40 10 12 07 40 ff ff ff ff ff ff ff ff
>0x0070          ff ff ff ff ff ff ff ff ff ff ff ff ff ff e3 68
>
>gw:~# cat /proc/interrupts
>             CPU0       CPU1
>    0:        219          0   IO-APIC-edge      timer
>    1:          2          0   IO-APIC-edge      i8042
>    3:      57597          0   IO-APIC-edge      serial
>    4:         11          0   IO-APIC-edge
>    7:          0          0   IO-APIC-edge      parport0
>    8:         56          0   IO-APIC-edge      rtc0
>    9:          0          0   IO-APIC-fasteoi   acpi
>   14:          0          0   IO-APIC-edge      ide0
>   16:         49          0   IO-APIC-fasteoi   uhci_hcd:usb4, ath
>   18:   67195406          0   IO-APIC-fasteoi   uhci_hcd:usb3, aur-mgt
>   19:      51743          0   IO-APIC-fasteoi   uhci_hcd:usb2, gst
>   23:          0          0   IO-APIC-fasteoi   uhci_hcd:usb1,
>ehci_hcd:usb5
>   28:      48717          0   PCI-MSI-edge      pbr-Q0
>   29:          2          0   PCI-MSI-edge      pbr
>   30:    4223924          0   PCI-MSI-edge      dmz-Q0
>   31:        158          0   PCI-MSI-edge      dmz
>   32:   58847077          0   PCI-MSI-edge      inet-Q0
>   33:     245438          0   PCI-MSI-edge      inet
>   35:     354469          0   PCI-MSI-edge      ahci
>  NMI:          0          0   Non-maskable interrupts
>  LOC:    6729669    6541747   Local timer interrupts
>  SPU:          0          0   Spurious interrupts
>  RES:      33615      25991   Rescheduling interrupts
>  CAL:         64        239   Function call interrupts
>  TLB:       7703       4053   TLB shootdowns
>  TRM:          0          0   Thermal event interrupts
>  ERR:          0
>  MIS:          0
>
>
>gw:~# lspci -vvv
>
>see: http://pastebin.com/MwniCbET
>
>Cheers,
>
>Lars
>
>
>
>---------------------------------------------------------------
>---------------
>Download Intel&#174; Parallel Studio Eval
>Try the new software tools for yourself. Speed compiling, find bugs
>proactively, and fine-tune applications for parallel performance.
>See why Intel Parallel Studio got high marks during beta.
>http://p.sf.net/sfu/intel-sw-dev
>_______________________________________________
>E1000-devel mailing list
>[email protected]
>https://lists.sourceforge.net/lists/listinfo/e1000-devel
>To learn more about Intel&#174; Ethernet, visit 
>http://communities.intel.com/community/wired
>
------------------------------------------------------------------------------
Download Intel&#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel&#174; Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to