В Срд, 27/01/2010 в 14:14 +0200, Покотиленко Костик пишет:
> В Вто, 26/01/2010 в 20:32 +0200, Покотиленко Костик пишет:
> > В Вто, 26/01/2010 в 09:35 -0800, Duyck, Alexander H пишет:
> > > Покотиленко Костик wrote:
> > > > Hi,
> > > >
> > > > Can somebody investigate please? Bug posted 19.01.2010/
> > > >
> > > > I have tried:
> > > > - 2.6.29 + igb 2.0.6
> > > > - 2.6.30 + igb 2.0.6
> > > > - 2.6.30 + igb 2.1.9
> > > >
> > > > all resulting in deep hang or network down or reboot in 1-20 hours
> > > > randomly.
> > > >
> > > > I have only 3 more variations to try:
> > > > - 2.6.30 + in kernel igb
> > > > - 2.6.32 + in kernel igb
> > > > - 2.6.32 + igb 2.1.9
> > > >
> >
> > Today I switched to 2.6.30 + in kernel igb 1.3.16-k2. Working fine for
> > 6+ hours, as for now. Noticed that it by default use 4 rx-queue and 4
> > tx-queue for each NIC and uses all cores available. 2.0.6 and 2.1.9 used
> > 1 core per NIC by default.
>
> 2.6.30 + in kernel igb 1.3.16-k2, after ~22 hours got this (copied some
> entries before the problem occured):
> ...
This time 5 hours:
- system works fine except for NIC
- this time there is "Detected Tx Unit Hang"
- kern.log attached
- ethtool -t show failure
- removing then inserting igb doesn't help
- Sequential ifconfig with several seconds delay:
#ifconfig eth0; ifconfig eth1
eth0 Link encap:Ethernet HWaddr 00:1b:21:51:9f:88
inet6 addr: fe80::21b:21ff:fe51:9f88/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:74498921 errors:0 dropped:24357 overruns:0 frame:0
TX packets:59194874 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:1382880915 (1.2 GiB) TX bytes:1129626086 (1.0 GiB)
Memory:b3020000-b3040000
eth1 Link encap:Ethernet HWaddr 00:1b:21:51:9f:89
inet6 addr: fe80::21b:21ff:fe51:9f89/64 Scope:Link
UP BROADCAST RUNNING PROMISC MULTICAST MTU:1500 Metric:1
RX packets:59102154 errors:8 dropped:35508 overruns:0 frame:8
TX packets:71590459 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:1036960256 (988.9 MiB) TX bytes:1229891912 (1.1 GiB)
Memory:b3000000-b3020000
#ifconfig eth0; ifconfig eth1
eth0 Link encap:Ethernet HWaddr 00:1b:21:51:9f:88
inet6 addr: fe80::21b:21ff:fe51:9f88/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:74498921 errors:0 dropped:24518 overruns:0 frame:0
TX packets:59194874 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:1382880915 (1.2 GiB) TX bytes:1129626086 (1.0 GiB)
Memory:b3020000-b3040000
eth1 Link encap:Ethernet HWaddr 00:1b:21:51:9f:89
inet6 addr: fe80::21b:21ff:fe51:9f89/64 Scope:Link
UP BROADCAST RUNNING PROMISC MULTICAST MTU:1500 Metric:1
RX packets:59102154 errors:8 dropped:35776 overruns:0 frame:8
TX packets:71590459 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:1036960256 (988.9 MiB) TX bytes:1229891912 (1.1 GiB)
Memory:b3000000-b3020000
- Sequential interrupts with several seconds delay:
# cat /proc/interrupts
CPU0 CPU1 CPU2 CPU3
0: 70 0 0 64 IO-APIC-edge
timer
3: 0 0 0 7 IO-APIC-edge
serial
4: 0 0 0 7193 IO-APIC-edge
serial
8: 0 0 0 63 IO-APIC-edge
rtc0
9: 0 0 0 0 IO-APIC-fasteoi
acpi
14: 0 0 0 0 IO-APIC-edge
ide0
15: 0 0 0 0 IO-APIC-edge
ide1
18: 0 0 0 45377 IO-APIC-fasteoi
ata_piix
21: 138 0 0 0 IO-APIC-fasteoi
ehci_hcd:usb1
22: 0 0 0 0 IO-APIC-fasteoi
ata_piix
23: 69 0 0 0 IO-APIC-fasteoi
ehci_hcd:usb2
24: 61918337 0 0 0 HPET_MSI-edge
hpet2
25: 0 41192452 0 0 HPET_MSI-edge
hpet3
26: 0 0 24796679 0 HPET_MSI-edge
hpet4
27: 0 0 0 807808 HPET_MSI-edge
hpet5
29: 0 0 0 0 PCI-MSI-edge
aerdrv
34: 0 0 0 0 PCI-MSI-edge
aerdrv
35: 0 0 0 0 PCI-MSI-edge
aerdrv
36: 39554468 0 0 0 PCI-MSI-edge
eth0-tx-0
37: 0 134 0 0 PCI-MSI-edge
eth0-tx-1
38: 0 20 0 0 PCI-MSI-edge
eth0-tx-2
39: 0 0 38 0 PCI-MSI-edge
eth0-tx-3
40: 0 0 17094068 0 PCI-MSI-edge
eth0-rx-0
41: 15895281 0 0 0 PCI-MSI-edge
eth0-rx-1
42: 17773413 0 0 0 PCI-MSI-edge
eth0-rx-2
43: 0 14889821 0 0 PCI-MSI-edge
eth0-rx-3
44: 0 6 0 0 PCI-MSI-edge
eth0
45: 0 0 44323277 0 PCI-MSI-edge
eth1-tx-0
46: 0 0 5 0 PCI-MSI-edge
eth1-tx-1
47: 0 0 0 2738 PCI-MSI-edge
eth1-tx-2
48: 0 0 0 90923 PCI-MSI-edge
eth1-tx-3
49: 13837440 0 0 0 PCI-MSI-edge
eth1-rx-0
50: 13562083 0 0 0 PCI-MSI-edge
eth1-rx-1
51: 0 13507931 0 0 PCI-MSI-edge
eth1-rx-2
52: 0 14586531 0 0 PCI-MSI-edge
eth1-rx-3
53: 0 0 12 0 PCI-MSI-edge
eth1
NMI: 0 0 0 0 Non-maskable
interrupts
LOC: 182 446 370 293 Local timer
interrupts
SPU: 0 0 0 0 Spurious interrupts
RES: 291985 259886 204801 920 Rescheduling
interrupts
CAL: 190 296 258 177 Function call
interrupts
TLB: 14406 21304 23222 22280 TLB shootdowns
TRM: 0 0 0 0 Thermal event
interrupts
ERR: 0
MIS: 0
# cat /proc/interrupts
CPU0 CPU1 CPU2 CPU3
0: 70 0 0 64 IO-APIC-edge
timer
3: 0 0 0 7 IO-APIC-edge
serial
4: 0 0 0 7417 IO-APIC-edge
serial
8: 0 0 0 63 IO-APIC-edge
rtc0
9: 0 0 0 0 IO-APIC-fasteoi
acpi
14: 0 0 0 0 IO-APIC-edge
ide0
15: 0 0 0 0 IO-APIC-edge
ide1
18: 0 0 0 45381 IO-APIC-fasteoi
ata_piix
21: 138 0 0 0 IO-APIC-fasteoi
ehci_hcd:usb1
22: 0 0 0 0 IO-APIC-fasteoi
ata_piix
23: 69 0 0 0 IO-APIC-fasteoi
ehci_hcd:usb2
24: 61918662 0 0 0 HPET_MSI-edge
hpet2
25: 0 41192577 0 0 HPET_MSI-edge
hpet3
26: 0 0 24796744 0 HPET_MSI-edge
hpet4
27: 0 0 0 807967 HPET_MSI-edge
hpet5
29: 0 0 0 0 PCI-MSI-edge
aerdrv
34: 0 0 0 0 PCI-MSI-edge
aerdrv
35: 0 0 0 0 PCI-MSI-edge
aerdrv
36: 39554468 0 0 0 PCI-MSI-edge
eth0-tx-0
37: 0 134 0 0 PCI-MSI-edge
eth0-tx-1
38: 0 20 0 0 PCI-MSI-edge
eth0-tx-2
39: 0 0 38 0 PCI-MSI-edge
eth0-tx-3
40: 0 0 17094071 0 PCI-MSI-edge
eth0-rx-0
41: 15895284 0 0 0 PCI-MSI-edge
eth0-rx-1
42: 17773416 0 0 0 PCI-MSI-edge
eth0-rx-2
43: 0 14889824 0 0 PCI-MSI-edge
eth0-rx-3
44: 0 6 0 0 PCI-MSI-edge
eth0
45: 0 0 44323277 0 PCI-MSI-edge
eth1-tx-0
46: 0 0 5 0 PCI-MSI-edge
eth1-tx-1
47: 0 0 0 2738 PCI-MSI-edge
eth1-tx-2
48: 0 0 0 90923 PCI-MSI-edge
eth1-tx-3
49: 13837444 0 0 0 PCI-MSI-edge
eth1-rx-0
50: 13562087 0 0 0 PCI-MSI-edge
eth1-rx-1
51: 0 13507935 0 0 PCI-MSI-edge
eth1-rx-2
52: 0 14586535 0 0 PCI-MSI-edge
eth1-rx-3
53: 0 0 12 0 PCI-MSI-edge
eth1
NMI: 0 0 0 0 Non-maskable
interrupts
LOC: 182 446 370 293 Local timer
interrupts
SPU: 0 0 0 0 Spurious interrupts
RES: 291985 259886 204801 920 Rescheduling
interrupts
CAL: 190 296 258 177 Function call
interrupts
TLB: 14408 21304 23222 22281 TLB shootdowns
TRM: 0 0 0 0 Thermal event
interrupts
ERR: 0
MIS: 0
- Also, with 2.6.30 I have problem bringing interfaces down (at reboot
time also) like this:
unregister_netdevice: waiting for eth1.11 to become free. Usage count =
11
every ~10 seconds. Usage count may be >100 decreasing slowly which can
make it 20 minutes to reboot. I've read on the net this is known problem
resolved in 2.6.32 related to vlans.
- Tommorow going to try 2.6.32 + inkernel igb
--
Покотиленко Костик <[email protected]>
------------------------------------------------------------------------------
The Planet: dedicated and managed hosting, cloud storage, colocation
Stay online with enterprise data centers and the best network in the business
Choose flexible plans and management services without long-term contracts
Personal 24x7 support from experience hosting pros just a phone call away.
http://p.sf.net/sfu/theplanet-com
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit
http://communities.intel.com/community/wired