Hello,

like some have reported before, we also have problems with the e1000
driver and the 82541GI network chips. During the day we have complete
network outages with those chips ranging from some seconds up to a
minute. We have tried different kernels (Base OS is Debian Lenny i686)
and we have tried 2.6.26, 2.6.30 (backports) and 2.6.32 (backports)
kernel images. I have also compiled the latest 8.0.19-NAPI driver for
2.6.30 but the problems persist. We have disabled all offloading
settings for tests but that did not change anything as well. Problems do
not seem to be related to system load or high network traffic, it just
happens randomly. We are seeing this behaviour on different Allnet FW
8888 devices which are equipped with two 82541 ports and three 82572EI
chips. The problems are limited to the 82541 ports, if we switch to the
82572EI ports everything works flawlessly.

I am happy to provide more information.

The following entry appeared in our dmesg today:

[57731.000029] ------------[ cut here ]------------
[57731.000057] WARNING: at
/build/buildd/linux-2.6-2.6.30/debian/build/source_i386_none/net/sched/sch_generic.c:226
dev_watchdog+0xa8/0x135()
[57731.000071] Hardware name: 945GSE
[57731.000079] NETDEV WATCHDOG: aur-mgt (e1000): transmit timed out
[57731.000087] Modules linked in: e1000 tun sit tunnel4 act_police
cls_flow cls_fw cls_u32 sch_htb sch_hfsc sch_ingress sch_sfq xt_time
xt_connlimit xt_realm iptable_raw xt_comment xt_recent xt_policy
ipt_ULOG ipt_REJECT ipt_REDIRECT ipt_NETMAP ipt_MASQUERADE ipt_ECN
ipt_ecn ipt_CLUSTERIP ipt_ah ipt_addrtype nf_nat_tftp nf_nat_snmp_basic
nf_nat_sip nf_nat_pptp nf_nat_proto_gre nf_nat_irc ppp_deflate
nf_nat_h323 zlib_deflate nf_nat_ftp nf_nat_amanda ts_kmp
nf_conntrack_amanda nf_conntrack_sane nf_conntrack_tftp nf_conntrack_sip
nf_conntrack_proto_sctp bsd_comp nf_conntrack_pptp
nf_conntrack_proto_gre nf_conntrack_netlink nf_conntrack_netbios_ns
nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp xt_tcpmss xt_pkttype
xt_physdev xt_owner xt_NFQUEUE xt_NFLOG nfnetlink_log xt_multiport
xt_MARK xt_mark xt_mac xt_limit xt_length xt_iprange xt_helper
xt_hashlimit xt_DSCP xt_dscp xt_dccp xt_conntrack xt_CONNMARK
xt_connmark xt_CLASSIFY ipt_LOG xt_tcpudp xt_state iptable_nat nf_nat
nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack iptable_mangle nfnetlink
iptable_filter ip_tables x_tables ppp_async crc_ccitt ppp_generic slhc
8021q garp stp loop arc4 ecb evdev ath9k mac80211 rfkill cfg80211
led_class parport_pc parport snd_pcm i2c_i801 snd_timer processor button
i2c_core serio_raw snd soundcore snd_page_alloc rng_core intel_agp
agpgart pcspkr ext3 jbd mbcache dm_mirror dm_region_hash dm_log
dm_snapshot dm_mod raid1 md_mod sd_mod crc_t10dif piix ide_pci_generic
ide_core ata_generic ahci libata uhci_hcd ehci_hcd scsi_mod usbcore
e1000e thermal fan thermal_sys [last unloaded: e1000]
[57731.000536] Pid: 0, comm: swapper Not tainted 2.6.30-bpo.2-686 #1
[57731.000545] Call Trace:
[57731.000564]  [<c0127160>] ? warn_slowpath_common+0x5e/0x8a
[57731.000580]  [<c029f614>] ? dev_watchdog+0x0/0x135
[57731.000594]  [<c01271be>] ? warn_slowpath_fmt+0x26/0x2a
[57731.000608]  [<c029f6bc>] ? dev_watchdog+0xa8/0x135
[57731.000625]  [<c012e6bf>] ? lock_timer_base+0x19/0x35
[57731.000639]  [<c012e7a4>] ? __mod_timer+0xc9/0xd3
[57731.000654]  [<c012e517>] ? run_timer_softirq+0x13d/0x19d
[57731.000668]  [<c029f614>] ? dev_watchdog+0x0/0x135
[57731.000685]  [<c012b25b>] ? __do_softirq+0x8e/0x135
[57731.000700]  [<c012b330>] ? do_softirq+0x2e/0x38
[57731.000715]  [<c012b413>] ? irq_exit+0x26/0x53
[57731.000730]  [<c01105b6>] ? smp_apic_timer_interrupt+0x6c/0x76
[57731.000747]  [<c0103946>] ? apic_timer_interrupt+0x2a/0x30
[57731.000762]  [<c013007b>] ? flush_signals+0xe/0x4e
[57731.000777]  [<c0108a4d>] ? mwait_idle+0x67/0x83
[57731.000792]  [<c010237a>] ? cpu_idle+0x46/0x60
[57731.000809]  [<c044476d>] ? start_kernel+0x2c2/0x2c5
[57731.000819] ---[ end trace c09cf38d283cbbf6 ]---

gw:~# ethtool -i aur-mgt
driver: e1000
version: 8.0.19-NAPI
firmware-version: N/A
bus-info: 0000:05:0b.0

gw:~# ethtool -S aur-mgt
NIC statistics:
      rx_packets: 24551123
      tx_packets: 43793850
      rx_bytes: 2087115200
      tx_bytes: 62061689856
      rx_broadcast: 11452
      tx_broadcast: 275
      rx_multicast: 26143
      tx_multicast: 7925
      rx_errors: 0
      tx_errors: 0
      tx_dropped: 0
      multicast: 26143
      collisions: 0
      rx_length_errors: 0
      rx_over_errors: 0
      rx_crc_errors: 0
      rx_frame_errors: 0
      rx_no_buffer_count: 446
      rx_missed_errors: 1762
      tx_aborted_errors: 0
      tx_carrier_errors: 0
      tx_fifo_errors: 0
      tx_heartbeat_errors: 0
      tx_window_errors: 0
      tx_abort_late_coll: 0
      tx_deferred_ok: 0
      tx_single_coll_ok: 0
      tx_multi_coll_ok: 0
      tx_timeout_count: 1
      tx_restart_queue: 0
      rx_long_length_errors: 0
      rx_short_length_errors: 0
      rx_align_errors: 0
      tx_tcp_seg_good: 13136
      tx_tcp_seg_failed: 0
      rx_flow_control_xon: 0
      rx_flow_control_xoff: 0
      tx_flow_control_xon: 0
      tx_flow_control_xoff: 0
      rx_long_byte_count: 2087115200
      rx_csum_offload_good: 23278564
      rx_csum_offload_errors: 1
      alloc_rx_buff_failed: 0
      tx_smbus: 0
      rx_smbus: 0
      dropped_smbus: 0



gw:~# ethtool -e aur-mgt

Offset          Values
------          ------
0x0000          00 0f c9 04 db 81 10 02 ff ff 00 10 ff ff ff ff
0x0010          ff ff ff ff 0b 64 76 10 86 80 76 10 86 80 84 b2
0x0020          dd 20 22 22 00 00 90 2f 80 23 12 00 20 1e 12 00
0x0030          20 1e 12 00 20 1e 12 00 20 1e 09 00 00 02 00 00
0x0040          0c 00 a6 93 0b 28 00 00 00 0c ff ff ff ff ff ff
0x0050          ff ff ff ff ff ff ff ff ff ff ff ff ff ff 02 06
0x0060          00 01 00 40 10 12 07 40 ff ff ff ff ff ff ff ff
0x0070          ff ff ff ff ff ff ff ff ff ff ff ff ff ff e3 68

gw:~# cat /proc/interrupts
             CPU0       CPU1
    0:        219          0   IO-APIC-edge      timer
    1:          2          0   IO-APIC-edge      i8042
    3:      57597          0   IO-APIC-edge      serial
    4:         11          0   IO-APIC-edge
    7:          0          0   IO-APIC-edge      parport0
    8:         56          0   IO-APIC-edge      rtc0
    9:          0          0   IO-APIC-fasteoi   acpi
   14:          0          0   IO-APIC-edge      ide0
   16:         49          0   IO-APIC-fasteoi   uhci_hcd:usb4, ath
   18:   67195406          0   IO-APIC-fasteoi   uhci_hcd:usb3, aur-mgt
   19:      51743          0   IO-APIC-fasteoi   uhci_hcd:usb2, gst
   23:          0          0   IO-APIC-fasteoi   uhci_hcd:usb1,
ehci_hcd:usb5
   28:      48717          0   PCI-MSI-edge      pbr-Q0
   29:          2          0   PCI-MSI-edge      pbr
   30:    4223924          0   PCI-MSI-edge      dmz-Q0
   31:        158          0   PCI-MSI-edge      dmz
   32:   58847077          0   PCI-MSI-edge      inet-Q0
   33:     245438          0   PCI-MSI-edge      inet
   35:     354469          0   PCI-MSI-edge      ahci
  NMI:          0          0   Non-maskable interrupts
  LOC:    6729669    6541747   Local timer interrupts
  SPU:          0          0   Spurious interrupts
  RES:      33615      25991   Rescheduling interrupts
  CAL:         64        239   Function call interrupts
  TLB:       7703       4053   TLB shootdowns
  TRM:          0          0   Thermal event interrupts
  ERR:          0
  MIS:          0


gw:~# lspci -vvv

see: http://pastebin.com/MwniCbET

Cheers,

Lars



------------------------------------------------------------------------------
Download Intel&#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel&#174; Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to