Hello,
like some have reported before, we also have problems with the e1000
driver and the 82541GI network chips. During the day we have complete
network outages with those chips ranging from some seconds up to a
minute. We have tried different kernels (Base OS is Debian Lenny i686)
and we have tried 2.6.26, 2.6.30 (backports) and 2.6.32 (backports)
kernel images. I have also compiled the latest 8.0.19-NAPI driver for
2.6.30 but the problems persist. We have disabled all offloading
settings for tests but that did not change anything as well. Problems do
not seem to be related to system load or high network traffic, it just
happens randomly. We are seeing this behaviour on different Allnet FW
8888 devices which are equipped with two 82541 ports and three 82572EI
chips. The problems are limited to the 82541 ports, if we switch to the
82572EI ports everything works flawlessly.
I am happy to provide more information.
The following entry appeared in our dmesg today:
[57731.000029] ------------[ cut here ]------------
[57731.000057] WARNING: at
/build/buildd/linux-2.6-2.6.30/debian/build/source_i386_none/net/sched/sch_generic.c:226
dev_watchdog+0xa8/0x135()
[57731.000071] Hardware name: 945GSE
[57731.000079] NETDEV WATCHDOG: aur-mgt (e1000): transmit timed out
[57731.000087] Modules linked in: e1000 tun sit tunnel4 act_police
cls_flow cls_fw cls_u32 sch_htb sch_hfsc sch_ingress sch_sfq xt_time
xt_connlimit xt_realm iptable_raw xt_comment xt_recent xt_policy
ipt_ULOG ipt_REJECT ipt_REDIRECT ipt_NETMAP ipt_MASQUERADE ipt_ECN
ipt_ecn ipt_CLUSTERIP ipt_ah ipt_addrtype nf_nat_tftp nf_nat_snmp_basic
nf_nat_sip nf_nat_pptp nf_nat_proto_gre nf_nat_irc ppp_deflate
nf_nat_h323 zlib_deflate nf_nat_ftp nf_nat_amanda ts_kmp
nf_conntrack_amanda nf_conntrack_sane nf_conntrack_tftp nf_conntrack_sip
nf_conntrack_proto_sctp bsd_comp nf_conntrack_pptp
nf_conntrack_proto_gre nf_conntrack_netlink nf_conntrack_netbios_ns
nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp xt_tcpmss xt_pkttype
xt_physdev xt_owner xt_NFQUEUE xt_NFLOG nfnetlink_log xt_multiport
xt_MARK xt_mark xt_mac xt_limit xt_length xt_iprange xt_helper
xt_hashlimit xt_DSCP xt_dscp xt_dccp xt_conntrack xt_CONNMARK
xt_connmark xt_CLASSIFY ipt_LOG xt_tcpudp xt_state iptable_nat nf_nat
nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack iptable_mangle nfnetlink
iptable_filter ip_tables x_tables ppp_async crc_ccitt ppp_generic slhc
8021q garp stp loop arc4 ecb evdev ath9k mac80211 rfkill cfg80211
led_class parport_pc parport snd_pcm i2c_i801 snd_timer processor button
i2c_core serio_raw snd soundcore snd_page_alloc rng_core intel_agp
agpgart pcspkr ext3 jbd mbcache dm_mirror dm_region_hash dm_log
dm_snapshot dm_mod raid1 md_mod sd_mod crc_t10dif piix ide_pci_generic
ide_core ata_generic ahci libata uhci_hcd ehci_hcd scsi_mod usbcore
e1000e thermal fan thermal_sys [last unloaded: e1000]
[57731.000536] Pid: 0, comm: swapper Not tainted 2.6.30-bpo.2-686 #1
[57731.000545] Call Trace:
[57731.000564] [<c0127160>] ? warn_slowpath_common+0x5e/0x8a
[57731.000580] [<c029f614>] ? dev_watchdog+0x0/0x135
[57731.000594] [<c01271be>] ? warn_slowpath_fmt+0x26/0x2a
[57731.000608] [<c029f6bc>] ? dev_watchdog+0xa8/0x135
[57731.000625] [<c012e6bf>] ? lock_timer_base+0x19/0x35
[57731.000639] [<c012e7a4>] ? __mod_timer+0xc9/0xd3
[57731.000654] [<c012e517>] ? run_timer_softirq+0x13d/0x19d
[57731.000668] [<c029f614>] ? dev_watchdog+0x0/0x135
[57731.000685] [<c012b25b>] ? __do_softirq+0x8e/0x135
[57731.000700] [<c012b330>] ? do_softirq+0x2e/0x38
[57731.000715] [<c012b413>] ? irq_exit+0x26/0x53
[57731.000730] [<c01105b6>] ? smp_apic_timer_interrupt+0x6c/0x76
[57731.000747] [<c0103946>] ? apic_timer_interrupt+0x2a/0x30
[57731.000762] [<c013007b>] ? flush_signals+0xe/0x4e
[57731.000777] [<c0108a4d>] ? mwait_idle+0x67/0x83
[57731.000792] [<c010237a>] ? cpu_idle+0x46/0x60
[57731.000809] [<c044476d>] ? start_kernel+0x2c2/0x2c5
[57731.000819] ---[ end trace c09cf38d283cbbf6 ]---
gw:~# ethtool -i aur-mgt
driver: e1000
version: 8.0.19-NAPI
firmware-version: N/A
bus-info: 0000:05:0b.0
gw:~# ethtool -S aur-mgt
NIC statistics:
rx_packets: 24551123
tx_packets: 43793850
rx_bytes: 2087115200
tx_bytes: 62061689856
rx_broadcast: 11452
tx_broadcast: 275
rx_multicast: 26143
tx_multicast: 7925
rx_errors: 0
tx_errors: 0
tx_dropped: 0
multicast: 26143
collisions: 0
rx_length_errors: 0
rx_over_errors: 0
rx_crc_errors: 0
rx_frame_errors: 0
rx_no_buffer_count: 446
rx_missed_errors: 1762
tx_aborted_errors: 0
tx_carrier_errors: 0
tx_fifo_errors: 0
tx_heartbeat_errors: 0
tx_window_errors: 0
tx_abort_late_coll: 0
tx_deferred_ok: 0
tx_single_coll_ok: 0
tx_multi_coll_ok: 0
tx_timeout_count: 1
tx_restart_queue: 0
rx_long_length_errors: 0
rx_short_length_errors: 0
rx_align_errors: 0
tx_tcp_seg_good: 13136
tx_tcp_seg_failed: 0
rx_flow_control_xon: 0
rx_flow_control_xoff: 0
tx_flow_control_xon: 0
tx_flow_control_xoff: 0
rx_long_byte_count: 2087115200
rx_csum_offload_good: 23278564
rx_csum_offload_errors: 1
alloc_rx_buff_failed: 0
tx_smbus: 0
rx_smbus: 0
dropped_smbus: 0
gw:~# ethtool -e aur-mgt
Offset Values
------ ------
0x0000 00 0f c9 04 db 81 10 02 ff ff 00 10 ff ff ff ff
0x0010 ff ff ff ff 0b 64 76 10 86 80 76 10 86 80 84 b2
0x0020 dd 20 22 22 00 00 90 2f 80 23 12 00 20 1e 12 00
0x0030 20 1e 12 00 20 1e 12 00 20 1e 09 00 00 02 00 00
0x0040 0c 00 a6 93 0b 28 00 00 00 0c ff ff ff ff ff ff
0x0050 ff ff ff ff ff ff ff ff ff ff ff ff ff ff 02 06
0x0060 00 01 00 40 10 12 07 40 ff ff ff ff ff ff ff ff
0x0070 ff ff ff ff ff ff ff ff ff ff ff ff ff ff e3 68
gw:~# cat /proc/interrupts
CPU0 CPU1
0: 219 0 IO-APIC-edge timer
1: 2 0 IO-APIC-edge i8042
3: 57597 0 IO-APIC-edge serial
4: 11 0 IO-APIC-edge
7: 0 0 IO-APIC-edge parport0
8: 56 0 IO-APIC-edge rtc0
9: 0 0 IO-APIC-fasteoi acpi
14: 0 0 IO-APIC-edge ide0
16: 49 0 IO-APIC-fasteoi uhci_hcd:usb4, ath
18: 67195406 0 IO-APIC-fasteoi uhci_hcd:usb3, aur-mgt
19: 51743 0 IO-APIC-fasteoi uhci_hcd:usb2, gst
23: 0 0 IO-APIC-fasteoi uhci_hcd:usb1,
ehci_hcd:usb5
28: 48717 0 PCI-MSI-edge pbr-Q0
29: 2 0 PCI-MSI-edge pbr
30: 4223924 0 PCI-MSI-edge dmz-Q0
31: 158 0 PCI-MSI-edge dmz
32: 58847077 0 PCI-MSI-edge inet-Q0
33: 245438 0 PCI-MSI-edge inet
35: 354469 0 PCI-MSI-edge ahci
NMI: 0 0 Non-maskable interrupts
LOC: 6729669 6541747 Local timer interrupts
SPU: 0 0 Spurious interrupts
RES: 33615 25991 Rescheduling interrupts
CAL: 64 239 Function call interrupts
TLB: 7703 4053 TLB shootdowns
TRM: 0 0 Thermal event interrupts
ERR: 0
MIS: 0
gw:~# lspci -vvv
see: http://pastebin.com/MwniCbET
Cheers,
Lars
------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit
http://communities.intel.com/community/wired