Re: em 6.6.6 - watchdog timeout
Hi Jack, --On Thursday, November 01, 2007 13:36 -0700 Jack Vogel [EMAIL PROTECTED] wrote: I should also note that this only applies to PCI-E NICs, 82571 and later. Jack Have tested - enable MSI with the original 6.6.6 driver - the new driver files you sent to -stable with and without MSI enabled In all cases I can run all tests and programs that previous gave watchdog problems without any problems. Thanks! /glz ... the future isMobile Goran Lowkrantz [EMAIL PROTECTED] System Architect, isMobile AB Sandviksgatan 81, PO Box 58, S-971 03 LuleƄ, Sweden Mobile: +46(0)70-587 87 82 http://www.ismobile.com ... ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
I should also note that this only applies to PCI-E NICs, 82571 and later. Jack ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
On 10/21/07, Mike Tancsa [EMAIL PROTECTED] wrote: At 12:10 AM 10/21/2007, Mike Andrews wrote: I haven't tried the 6.6.6 driver on mine yet, though, so this could be something totally different. I was going to bump one of them from RELENG_6 to RELENG_7 as a test soon. I see this problem running RELENG_6, which has the 6.6.6 driver. I forget the exact supermicro model # Timecounter i8254 frequency 1193182 Hz quality 0 CPU: Intel(R) Core(TM)2 CPU 6600 @ 2.40GHz (2402.50-MHz 686-class CPU) Origin = GenuineIntel Id = 0x6f6 Stepping = 6 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM AMD Features=0x2010NX,LM AMD Features2=0x1LAHF Cores per package: 2 real memory = 2144329728 (2044 MB) avail memory = 2092859392 (1995 MB) ACPI APIC Table: INTEL S3000AHX FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0: Changing APIC ID to 5 ioapic0 Version 2.0 irqs 0-23 on motherboard ioapic1 Version 2.0 irqs 30-53 on motherboard kbd1 at kbdmux0 acpi0: INTEL S3000AHX on motherboard em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x3020-0x303f mem 0x8826-0x8827,0x8824-0x8825 irq 16 at device 0.0 on pci1 em0: Ethernet address: 00:15:17:12:f6:04 em0: [FAST] em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x3000-0x301f mem 0x8822-0x8823,0x8820-0x8821 irq 17 at device 0.1 on pci1 em1: Ethernet address: 00:15:17:12:f6:05 em1: [FAST] em2: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x2000-0x201f mem 0x8818-0x8819,0x8810-0x8817 irq 17 at device 0.0 on pci5 em2: Ethernet address: 00:15:17:29:6f:ef em2: [FAST] em3: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x1100-0x113f mem 0x8802-0x8803,0x8800-0x8801 irq 17 at device 5.0 on pci6 em3: Ethernet address: 00:15:17:29:6f:f0 em3: [FAST] I already ran the dos util to fix the eeprom, but no difference. I would like you all to try using MSI interrupts, watchdogs don't happen when I do this. If you have hardware that has a system issue with MSI then ignore this, but these SuperMicros systems should be fine. First, you must enable it on the system: sysctl hw.pci.enable_msi=1 Then you must reload the driver. If you use em static in the kernel you will have to change the loader.conf to enable msi on boot. I am going to add a display that will tell you when an adapter uses MSI or MSI/X next time I check in code. Not only does this solve my watchdog problems, I also find on the UDP_STREAM test of netperf that I get better performance when using MSI. Let me know how it works if you try this. Cheers, Jack ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
Jeremy Chadwick wrote: On Sat, Oct 20, 2007 at 11:21:10AM +1300, Philip Murray wrote: me too on a Supermicro 5015MT+, although I notice my em0 is also sharing an interrupt with USB (uhci3)... not sure if that's the culprit. I'm not aware of a 5015MT+ model. Maybe you mean 5015M-MT+ or 5015M-T+? We've got five 5015M-MT+'s in production. The PDSMI+ motherboard has two 82573's, one of which has a known EEPROM issue that Jack Vogel has sent a DOS-based utility around to fix up (search the archives for dcgdis.exe). It fixed watchdog timeouts I was having way back in the 6.2 beta cycle -- in my case they only happened when I had link at 1G and went away if I dropped to 100M. It might be worth ruling that out first unless you've already done that. :) I haven't tried the 6.6.6 driver on mine yet, though, so this could be something totally different. I was going to bump one of them from RELENG_6 to RELENG_7 as a test soon. ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
At 08:32 PM 10/19/2007, Jack Vogel wrote: OK, I will look into this as soon as I can. Just a Same here I think. The time outs dont correlate with load. The one nic is also seeing a lot of overruns. Also, I checked the eeprom and its not an issue it seems Oct 21 14:24:39 c1 kernel: Interface EEPROM Dump: Oct 21 14:24:39 c1 kernel: Offset Oct 21 14:24:39 c1 kernel: 0x 1500 1217 04f6 0420 5062 Oct 21 14:24:39 c1 kernel: 0x0010 d508 6801 a42f 115e 8086 105e 8086 b165 Oct 21 14:24:39 c1 kernel: 0x0020 0008 105e 5400 5001 0100 Oct 21 14:24:39 c1 kernel: 0x0030 6cf6 37b0 07a6 8403 0783 c303 0602 [EMAIL PROTECTED]:0:0: class=0x02 card=0x115e8086 chip=0x105e8086 rev=0x06 hdr=0x00 vendor = 'Intel Corporation' device = 'PRO/1000 PT' class = network subclass = ethernet cap 01[c8] = powerspec 2 supports D0 D3 current D0 cap 05[d0] = MSI supports 1 message, 64 bit cap 10[e0] = PCI-Express 1 endpoint [EMAIL PROTECTED]:0:1: class=0x02 card=0x115e8086 chip=0x105e8086 rev=0x06 hdr=0x00 vendor = 'Intel Corporation' device = 'PRO/1000 PT' class = network subclass = ethernet cap 01[c8] = powerspec 2 supports D0 D3 current D0 cap 05[d0] = MSI supports 1 message, 64 bit cap 10[e0] = PCI-Express 1 endpoint [EMAIL PROTECTED]:0:0: class=0x02 card=0x348d8086 chip=0x108c8086 rev=0x03 hdr=0x00 vendor = 'Intel Corporation' device = 'PRO/1000 PM' class = network subclass = ethernet cap 01[c8] = powerspec 2 supports D0 D3 current D0 cap 05[d0] = MSI supports 1 message, 64 bit cap 10[e0] = PCI-Express 1 endpoint [EMAIL PROTECTED]:5:0: class=0x02 card=0x348d8086 chip=0x10768086 rev=0x05 hdr=0x00 vendor = 'Intel Corporation' device = '82547EI Gigabit Ethernet Controller' class = network subclass = ethernet cap 01[dc] = powerspec 2 supports D0 D3 current D0 cap 07[e4] = PCI-X supports 2048 burst read, 1 split transaction Oct 21 02:15:01 c1 kernel: em1: Excessive collisions = 0 Oct 21 02:15:01 c1 kernel: em1: Sequence errors = 0 Oct 21 02:15:01 c1 kernel: em1: Defer count = 0 Oct 21 02:15:01 c1 kernel: em1: Missed Packets = 0 Oct 21 02:15:01 c1 kernel: em1: Receive No Buffers = 0 Oct 21 02:15:01 c1 kernel: em1: Receive Length Errors = 0 Oct 21 02:15:01 c1 kernel: em1: Receive errors = 0 Oct 21 02:15:01 c1 kernel: em1: Crc errors = 0 Oct 21 02:15:01 c1 kernel: em1: Alignment errors = 0 Oct 21 02:15:01 c1 kernel: em1: Collision/Carrier extension errors = 0 Oct 21 02:15:01 c1 kernel: em1: RX overruns = 0 Oct 21 02:15:01 c1 kernel: em1: watchdog timeouts = 2 Oct 21 02:15:01 c1 kernel: em1: XON Rcvd = 0 Oct 21 02:15:01 c1 kernel: em1: XON Xmtd = 0 Oct 21 02:15:01 c1 kernel: em1: XOFF Rcvd = 0 Oct 21 02:15:01 c1 kernel: em1: XOFF Xmtd = 0 Oct 21 02:15:01 c1 kernel: em1: Good Packets Rcvd = 972845 Oct 21 02:15:01 c1 kernel: em1: Good Packets Xmtd = 1056492 Oct 21 02:15:01 c1 kernel: em1: TSO Contexts Xmtd = 0 Oct 21 02:15:01 c1 kernel: em1: TSO Contexts Failed = 0 Oct 21 02:15:02 c1 kernel: em2: Excessive collisions = 0 Oct 21 02:15:02 c1 kernel: em2: Sequence errors = 0 Oct 21 02:15:02 c1 kernel: em2: Defer count = 0 Oct 21 02:15:02 c1 kernel: em2: Missed Packets = 146876 Oct 21 02:15:02 c1 kernel: em2: Receive No Buffers = 24633 Oct 21 02:15:02 c1 kernel: em2: Receive Length Errors = 0 Oct 21 02:15:02 c1 kernel: em2: Receive errors = 0 Oct 21 02:15:02 c1 kernel: em2: Crc errors = 0 Oct 21 02:15:02 c1 kernel: em2: Alignment errors = 0 Oct 21 02:15:02 c1 kernel: em2: Collision/Carrier extension errors = 0 Oct 21 02:15:02 c1 kernel: em2: RX overruns = 1261 Oct 21 02:15:02 c1 kernel: em2: watchdog timeouts = 0 Oct 21 02:15:02 c1 kernel: em2: XON Rcvd = 0 Oct 21 02:15:02 c1 kernel: em2: XON Xmtd = 0 Oct 21 02:15:02 c1 kernel: em2: XOFF Rcvd = 0 Oct 21 02:15:02 c1 kernel: em2: XOFF Xmtd = 0 Oct 21 02:15:02 c1 kernel: em2: Good Packets Rcvd = 11520947867 Oct 21 02:15:02 c1 kernel: em2: Good Packets Xmtd = 6732398306 Oct 21 02:15:02 c1 kernel: em2: TSO Contexts Xmtd = 0 Oct 21 02:15:02 c1 kernel: em2: TSO Contexts Failed = 0 # vmstat -i interrupt total rate irq1: atkbd0 5 0 irq4: sio0 39518 0 irq16: em0 935164706 1022 irq17: em1 em2 em3 802572934877 irq19: atapci1179422 0 cpu0: timer 1828072218 1997 cpu1: timer 1828031524 1997 Total 5394060327 5895 Jack On 10/19/07, Philip Murray [EMAIL PROTECTED] wrote: On 20/10/2007, at 1:06 AM, Goran Lowkrantz wrote: Hi, After the update of em to 6.6.6 last, I experience watchdog timeouts on a server running 6-STABLE. me too on a Supermicro 5015MT+, although I notice my em0 is also
Re: em 6.6.6 - watchdog timeout
At 12:10 AM 10/21/2007, Mike Andrews wrote: I haven't tried the 6.6.6 driver on mine yet, though, so this could be something totally different. I was going to bump one of them from RELENG_6 to RELENG_7 as a test soon. I see this problem running RELENG_6, which has the 6.6.6 driver. I forget the exact supermicro model # Timecounter i8254 frequency 1193182 Hz quality 0 CPU: Intel(R) Core(TM)2 CPU 6600 @ 2.40GHz (2402.50-MHz 686-class CPU) Origin = GenuineIntel Id = 0x6f6 Stepping = 6 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM AMD Features=0x2010NX,LM AMD Features2=0x1LAHF Cores per package: 2 real memory = 2144329728 (2044 MB) avail memory = 2092859392 (1995 MB) ACPI APIC Table: INTEL S3000AHX FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0: Changing APIC ID to 5 ioapic0 Version 2.0 irqs 0-23 on motherboard ioapic1 Version 2.0 irqs 30-53 on motherboard kbd1 at kbdmux0 acpi0: INTEL S3000AHX on motherboard em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x3020-0x303f mem 0x8826-0x8827,0x8824-0x8825 irq 16 at device 0.0 on pci1 em0: Ethernet address: 00:15:17:12:f6:04 em0: [FAST] em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x3000-0x301f mem 0x8822-0x8823,0x8820-0x8821 irq 17 at device 0.1 on pci1 em1: Ethernet address: 00:15:17:12:f6:05 em1: [FAST] em2: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x2000-0x201f mem 0x8818-0x8819,0x8810-0x8817 irq 17 at device 0.0 on pci5 em2: Ethernet address: 00:15:17:29:6f:ef em2: [FAST] em3: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x1100-0x113f mem 0x8802-0x8803,0x8800-0x8801 irq 17 at device 5.0 on pci6 em3: Ethernet address: 00:15:17:29:6f:f0 em3: [FAST] I already ran the dos util to fix the eeprom, but no difference. ---Mike ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
[EMAIL PROTECTED] wrote: Hi, snip When running netstat between servers balder and midgard, server balder get watchdog timeouts and resets the connection for a few seconds. Oct 19 13:12:47 balder kernel: em0: watchdog timeout -- resetting s/netstat/netperf/ ... the future isMobile Goran Lowkrantz [EMAIL PROTECTED] System Architect, iaMobile AB Sandviksgatan 81, PO Box 58, S-971 03 LuleƄ, Sweden Mobile: +46(0)70-587 87 82 http://www.ismobile.com ... ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
OK, I will look into this as soon as I can. Jack On 10/19/07, Philip Murray [EMAIL PROTECTED] wrote: On 20/10/2007, at 1:06 AM, Goran Lowkrantz wrote: Hi, After the update of em to 6.6.6 last, I experience watchdog timeouts on a server running 6-STABLE. me too on a Supermicro 5015MT+, although I notice my em0 is also sharing an interrupt with USB (uhci3)... not sure if that's the culprit. [EMAIL PROTECTED] ~]$ dmesg Copyright (c) 1992-2007 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 6.2-STABLE #0: Tue Oct 9 07:45:50 NZDT 2007 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC ACPI APIC Table: PTLTD APIC Timecounter i8254 frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(R) CPU X3220 @ 2.40GHz (2394.01-MHz 686- class CPU) Origin = GenuineIntel Id = 0x6f7 Stepping = 7 Features = 0xbfebfbff FPU ,VME ,DE ,PSE ,TSC ,MSR ,PAE ,MCE ,CX8 ,APIC ,SEP ,MTRR ,PGE ,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM AMD Features=0x2010NX,LM AMD Features2=0x1LAHF Cores per package: 4 real memory = 2146304000 (2046 MB) avail memory = 2095353856 (1998 MB) ioapic0 Version 2.0 irqs 0-23 on motherboard ioapic1 Version 2.0 irqs 24-47 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: PTLTD RSDT on motherboard acpi0: Power Button (fixed) Timecounter ACPI-fast frequency 3579545 Hz quality 1000 acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0 cpu0: ACPI CPU on acpi0 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0 pci0: ACPI PCI bus on pcib0 pcib1: ACPI PCI-PCI bridge irq 16 at device 1.0 on pci0 pci1: ACPI PCI bus on pcib1 pcib2: ACPI PCI-PCI bridge irq 17 at device 28.0 on pci0 pci9: ACPI PCI bus on pcib2 pcib3: ACPI PCI-PCI bridge at device 0.0 on pci9 pci10: ACPI PCI bus on pcib3 pcib4: PCI-PCI bridge at device 1.0 on pci10 pci11: PCI bus on pcib4 arcmsr0: Areca SATA Host Adapter RAID Controller mem 0xe020-0xe0200fff,0xe080-0xe0bf irq 26 at device 14.0 on pci11 ARECA RAID ADAPTER0: Driver Version 1.20.00.14 2007-2-05 ARECA RAID ADAPTER0: FIRMWARE VERSION V1.42 2006-10-13 pcib5: ACPI PCI-PCI bridge irq 17 at device 28.4 on pci0 pci13: ACPI PCI bus on pcib5 em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x4000-0x401f mem 0xe030-0xe031 irq 16 at device 0.0 on pci13 em0: Ethernet address: 00:30:48:90:48:dc em0: [FAST] pcib6: ACPI PCI-PCI bridge irq 16 at device 28.5 on pci0 pci14: ACPI PCI bus on pcib6 em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x5000-0x501f mem 0xe040-0xe041 irq 17 at device 0.0 on pci14 em1: Ethernet address: 00:30:48:90:48:dd em1: [FAST] uhci0: UHCI (generic) USB controller port 0x3000-0x301f irq 23 at device 29.0 on pci0 uhci0: [GIANT-LOCKED] usb0: UHCI (generic) USB controller on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: UHCI (generic) USB controller port 0x3020-0x303f irq 19 at device 29.1 on pci0 uhci1: [GIANT-LOCKED] usb1: UHCI (generic) USB controller on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhci2: UHCI (generic) USB controller port 0x3040-0x305f irq 18 at device 29.2 on pci0 uhci2: [GIANT-LOCKED] usb2: UHCI (generic) USB controller on uhci2 usb2: USB revision 1.0 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub2: 2 ports with 2 removable, self powered uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at device 29.3 on pci0 uhci3: [GIANT-LOCKED] usb3: UHCI (generic) USB controller on uhci3 usb3: USB revision 1.0 uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub3: 2 ports with 2 removable, self powered ehci0: Intel 82801GB/R (ICH7) USB 2.0 controller mem 0xe000-0xe3ff irq 23 at device 29.7 on pci0 ehci0: [GIANT-LOCKED] usb4: EHCI version 1.0 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4: Intel 82801GB/R (ICH7) USB 2.0 controller on ehci0 usb4: USB revision 2.0 uhub4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub4: 8 ports with 8 removable, self powered pcib7: ACPI PCI-PCI bridge at device 30.0 on pci0 pci15: ACPI PCI bus on pcib7 pci15: display, VGA at device 0.0 (no driver attached) isab0: PCI-ISA bridge at device 31.0 on pci0 isa0: ISA bus on isab0 atapci0: Intel ICH7 UDMA100 controller port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x30a0-0x30af at device 31.1 on pci0 ata0: ATA channel 0 on atapci0 ata1: ATA channel 1
Re: em 6.6.6 - watchdog timeout
On 20/10/2007, at 5:03 PM, Jeremy Chadwick wrote: On Sat, Oct 20, 2007 at 11:21:10AM +1300, Philip Murray wrote: me too on a Supermicro 5015MT+, although I notice my em0 is also sharing an interrupt with USB (uhci3)... not sure if that's the culprit. I'm not aware of a 5015MT+ model. Maybe you mean 5015M-MT+ or 5015M-T+? We have two 5015M-T+ boxes, one running RELENG_7 and one running RELENG_6, and neither exhibit this problem. Here's relevant em(4) information from both boxes; I do find the vmstat -i IRQ on the 2nd box a little odd, but operationally it seems fine. box1 (RELENG_6) === em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0x4000-0x401f mem 0xe020-0xe021 irq 16 at device 0.0 on pci13 em1: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0x5000-0x501f mem 0xe030-0xe031 irq 17 at device 0.0 on pci14 irq16: em0 uhci3 117371806 11 irq17: em1 49605005 4 box2 (RELENG_7) === em0: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port 0x4000-0x401f mem 0xe800-0xe801 irq 16 at device 0.0 on pci13 em1: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port 0x5000-0x501f mem 0xe820-0xe821 irq 17 at device 0.0 on pci14 irq256: em0 502854 4 irq257: em1 5438 0 On box2, IRQ 16 is also shared with uhci3 and vgapci0, but vmstat doesn't seem to show that. uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at device 29.3 on pci0 vgapci0: VGA-compatible display port 0x6000-0x60ff mem 0xe000-0xe7ff,0xe830-0xe830 irq 16 at device 0.0 on pci15 Hi Jeremy You don't seem to be running the 6.6.6 driver, which is when the watchdog timeouts started occurring. Had no problems with 6.2.9. Cheers Phil ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
On Sat, Oct 20, 2007 at 11:21:10AM +1300, Philip Murray wrote: me too on a Supermicro 5015MT+, although I notice my em0 is also sharing an interrupt with USB (uhci3)... not sure if that's the culprit. I'm not aware of a 5015MT+ model. Maybe you mean 5015M-MT+ or 5015M-T+? We have two 5015M-T+ boxes, one running RELENG_7 and one running RELENG_6, and neither exhibit this problem. Here's relevant em(4) information from both boxes; I do find the vmstat -i IRQ on the 2nd box a little odd, but operationally it seems fine. box1 (RELENG_6) === em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0x4000-0x401f mem 0xe020-0xe021 irq 16 at device 0.0 on pci13 em1: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0x5000-0x501f mem 0xe030-0xe031 irq 17 at device 0.0 on pci14 irq16: em0 uhci3 117371806 11 irq17: em1 49605005 4 box2 (RELENG_7) === em0: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port 0x4000-0x401f mem 0xe800-0xe801 irq 16 at device 0.0 on pci13 em1: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port 0x5000-0x501f mem 0xe820-0xe821 irq 17 at device 0.0 on pci14 irq256: em0 502854 4 irq257: em1 5438 0 On box2, IRQ 16 is also shared with uhci3 and vgapci0, but vmstat doesn't seem to show that. uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at device 29.3 on pci0 vgapci0: VGA-compatible display port 0x6000-0x60ff mem 0xe000-0xe7ff,0xe830-0xe830 irq 16 at device 0.0 on pci15 -- | Jeremy Chadwickjdc at parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to [EMAIL PROTECTED]
Re: em 6.6.6 - watchdog timeout
On 20/10/2007, at 1:06 AM, Goran Lowkrantz wrote: Hi, After the update of em to 6.6.6 last, I experience watchdog timeouts on a server running 6-STABLE. me too on a Supermicro 5015MT+, although I notice my em0 is also sharing an interrupt with USB (uhci3)... not sure if that's the culprit. [EMAIL PROTECTED] ~]$ dmesg Copyright (c) 1992-2007 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 6.2-STABLE #0: Tue Oct 9 07:45:50 NZDT 2007 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC ACPI APIC Table: PTLTD APIC Timecounter i8254 frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(R) CPU X3220 @ 2.40GHz (2394.01-MHz 686- class CPU) Origin = GenuineIntel Id = 0x6f7 Stepping = 7 Features = 0xbfebfbff FPU ,VME ,DE ,PSE ,TSC ,MSR ,PAE ,MCE ,CX8 ,APIC ,SEP ,MTRR ,PGE ,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM AMD Features=0x2010NX,LM AMD Features2=0x1LAHF Cores per package: 4 real memory = 2146304000 (2046 MB) avail memory = 2095353856 (1998 MB) ioapic0 Version 2.0 irqs 0-23 on motherboard ioapic1 Version 2.0 irqs 24-47 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: PTLTD RSDT on motherboard acpi0: Power Button (fixed) Timecounter ACPI-fast frequency 3579545 Hz quality 1000 acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0 cpu0: ACPI CPU on acpi0 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0 pci0: ACPI PCI bus on pcib0 pcib1: ACPI PCI-PCI bridge irq 16 at device 1.0 on pci0 pci1: ACPI PCI bus on pcib1 pcib2: ACPI PCI-PCI bridge irq 17 at device 28.0 on pci0 pci9: ACPI PCI bus on pcib2 pcib3: ACPI PCI-PCI bridge at device 0.0 on pci9 pci10: ACPI PCI bus on pcib3 pcib4: PCI-PCI bridge at device 1.0 on pci10 pci11: PCI bus on pcib4 arcmsr0: Areca SATA Host Adapter RAID Controller mem 0xe020-0xe0200fff,0xe080-0xe0bf irq 26 at device 14.0 on pci11 ARECA RAID ADAPTER0: Driver Version 1.20.00.14 2007-2-05 ARECA RAID ADAPTER0: FIRMWARE VERSION V1.42 2006-10-13 pcib5: ACPI PCI-PCI bridge irq 17 at device 28.4 on pci0 pci13: ACPI PCI bus on pcib5 em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x4000-0x401f mem 0xe030-0xe031 irq 16 at device 0.0 on pci13 em0: Ethernet address: 00:30:48:90:48:dc em0: [FAST] pcib6: ACPI PCI-PCI bridge irq 16 at device 28.5 on pci0 pci14: ACPI PCI bus on pcib6 em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0x5000-0x501f mem 0xe040-0xe041 irq 17 at device 0.0 on pci14 em1: Ethernet address: 00:30:48:90:48:dd em1: [FAST] uhci0: UHCI (generic) USB controller port 0x3000-0x301f irq 23 at device 29.0 on pci0 uhci0: [GIANT-LOCKED] usb0: UHCI (generic) USB controller on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: UHCI (generic) USB controller port 0x3020-0x303f irq 19 at device 29.1 on pci0 uhci1: [GIANT-LOCKED] usb1: UHCI (generic) USB controller on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhci2: UHCI (generic) USB controller port 0x3040-0x305f irq 18 at device 29.2 on pci0 uhci2: [GIANT-LOCKED] usb2: UHCI (generic) USB controller on uhci2 usb2: USB revision 1.0 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub2: 2 ports with 2 removable, self powered uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at device 29.3 on pci0 uhci3: [GIANT-LOCKED] usb3: UHCI (generic) USB controller on uhci3 usb3: USB revision 1.0 uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub3: 2 ports with 2 removable, self powered ehci0: Intel 82801GB/R (ICH7) USB 2.0 controller mem 0xe000-0xe3ff irq 23 at device 29.7 on pci0 ehci0: [GIANT-LOCKED] usb4: EHCI version 1.0 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4: Intel 82801GB/R (ICH7) USB 2.0 controller on ehci0 usb4: USB revision 2.0 uhub4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub4: 8 ports with 8 removable, self powered pcib7: ACPI PCI-PCI bridge at device 30.0 on pci0 pci15: ACPI PCI bus on pcib7 pci15: display, VGA at device 0.0 (no driver attached) isab0: PCI-ISA bridge at device 31.0 on pci0 isa0: ISA bus on isab0 atapci0: Intel ICH7 UDMA100 controller port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x30a0-0x30af at device 31.1 on pci0 ata0: ATA channel 0 on atapci0 ata1: ATA channel 1 on atapci0 pci0: serial bus, SMBus at device 31.3 (no driver attached) acpi_button0: Power Button on acpi0 sio0: 16550A-compatible COM port port 0x3f8-0x3ff irq 4 flags 0x10 on
em 6.6.6 - watchdog timeout
Hi, After the update of em to 6.6.6 last, I experience watchdog timeouts on a server running 6-STABLE. I have two identical servers with Intel D915GAV boards. Both have Intel PRO/1000 PCI-Express network cards. Server balder: em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0xac00-0xac1f mem 0xff60-0xff61,0xff62-0xff63 irq 16 at device 0.0 on pci5 em0: Ethernet address: 00:1b:21:00:48:c4 em0: [FAST] # vmstat -i interrupt total rate irq1: atkbd0 3 0 irq4: sio0 2 0 irq6: fdc012 0 irq14: ata0 68 0 irq16: em0 uhci3 219828879450 irq19: uhci1++ 4287947 8 irq22: ahc0232717293476 irq23: uhci0 ehci0 1 0 cpu0: timer976552804 2000 Total 1433387009 2935 # netstat -i NameMtu Network Address Ipkts IerrsOpkts Oerrs Coll em01500 Link#1 00:1b:21:00:48:c4 209880531 773 2062284 0 em01500 10.255.253/24 balder215210996 - 212337968 - - plip0 1500 Link#2 0 00 0 0 lo0 16384 Link#312040055 0 12055326 0 0 lo0 16384 fe80:3::1 fe80:3::10 -0 - - lo0 16384 localhost ::1 6 -6 - - lo0 16384 your-net localhost 6249979 - 6249980 - - 00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory Controller Hub (rev 04) 00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express Root Port (rev 04) 00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL Integrated Graphics Controller (rev 04) 00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 1 (rev 03) 00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 2 (rev 03) 00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 3 (rev 03) 00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 4 (rev 03) 00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #1 (rev 03) 00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #2 (rev 03) 00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #3 (rev 03) 00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #4 (rev 03) 00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB2 EHCI Controller (rev 03) 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3) 00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC Interface Bridge (rev 03) 00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) IDE Controller (rev 03) 00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA Controller (rev 03) 00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) SMBus Controller (rev 03) 05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet Controller (Copper) (rev 06) 06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev 01) Server midgard: em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0xac00-0xac1f mem 0xff50-0xff51,0xff52-0xff53 irq 16 at device 0.0 on pci5 em0: Ethernet address: 00:15:17:0e:05:f7 [EMAIL PROTECTED] vmstat -i interrupt total rate irq1: atkbd0 11 0 irq4: sio0 2142746 0 irq6: fdc014 0 irq14: ata0 252 0 irq16: em0+40101164 irq19: atapci1+ 7932757 1 irq22: ahc0 87074425 21 cpu0: timer 3807810138937 Total 4571600444 1125 [EMAIL PROTECTED] netstat -i NameMtu Network Address Ipkts IerrsOpkts Oerrs Coll em01500 Link#1 00:15:17:0e:05:f7 343771280 0 474609731 0 0 em01500 10.255.253/24 midgard 347467842 - 478700485 - - plip0 1500 Link#2 0 00 0 0 lo0 16384 Link#316821054 0 16947668 0 0 lo0 16384 fe80:3::1 fe80:3::10 -0 - - lo0 16384 localhost ::1 2610 - 2610 - - lo0 16384 your-net localhost 12616879 - 12616879 - - lo0 16384 10.255.253.12 appsrv1 0 -0 - -
Re: em 6.6.6 - watchdog timeout
[EMAIL PROTECTED] wrote: Hi, After the update of em to 6.6.6 last, I experience watchdog timeouts on a server running 6-STABLE. I have two identical servers with Intel D915GAV boards. Both have Intel PRO/1000 PCI-Express network cards. Server balder: em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 0xac00-0xac1f mem 0xff60-0xff61,0xff62-0xff63 irq 16 at device 0.0 on pci5 em0: Ethernet address: 00:1b:21:00:48:c4 em0: [FAST] # vmstat -i interrupt total rate irq1: atkbd0 3 0 irq4: sio0 2 0 irq6: fdc012 0 irq14: ata0 68 0 irq16: em0 uhci3 219828879450 irq19: uhci1++ 4287947 8 irq22: ahc0232717293476 irq23: uhci0 ehci0 1 0 cpu0: timer976552804 2000 Total 1433387009 2935 # netstat -i NameMtu Network Address Ipkts IerrsOpkts Oerrs Coll em01500 Link#1 00:1b:21:00:48:c4 209880531 773 20622 84 0 em01500 10.255.253/24 balder215210996 - 212337968 - - plip0 1500 Link#2 0 00 0 0 lo0 16384 Link#312040055 0 12055326 0 0 lo0 16384 fe80:3::1 fe80:3::10 -0 - - lo0 16384 localhost ::1 6 -6 - - lo0 16384 your-net localhost 6249979 - 6249980 - - 00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory Controller Hub (rev 04) 00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express Root Port (rev 04) 00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL Integrated Graphics Controller (rev 04) 00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 1 (rev 03) 00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 2 (rev 03) 00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 3 (rev 03) 00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 4 (rev 03) 00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #1 (rev 03) 00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #2 (rev 03) 00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #3 (rev 03) 00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #4 (rev 03) 00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB2 EHCI Controller (rev 03) 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3) 00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC Interface Bridge (rev 03) 00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) IDE Controller (rev 03) 00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA Controller (rev 03) 00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) SMBus Controller (rev 03) 05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet Controller (Copper) (rev 06) 06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev 01) Server midgard: em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0xac00-0xac1f mem 0xff50-0xff51,0xff52-0xff53 irq 16 at device 0.0 on pci5 em0: Ethernet address: 00:15:17:0e:05:f7 [EMAIL PROTECTED] vmstat -i interrupt total rate irq1: atkbd0 11 0 irq4: sio0 2142746 0 irq6: fdc014 0 irq14: ata0 252 0 irq16: em0+40101164 irq19: atapci1+ 7932757 1 irq22: ahc0 87074425 21 cpu0: timer 3807810138937 Total 4571600444 1125 [EMAIL PROTECTED] netstat -i NameMtu Network Address Ipkts IerrsOpkts Oerrs Coll em01500 Link#1 00:15:17:0e:05:f7 343771280 0 474609731 0 0 em01500 10.255.253/24 midgard 347467842 - 478700485 - - plip0 1500 Link#2 0 00 0 0 lo0 16384 Link#316821054 0 16947668 0 0 lo0 16384 fe80:3::1 fe80:3::10 -0 - - lo0 16384 localhost ::1 2610 - 2610 - - lo0 16384 your-net localhost 12616879 - 12616879 - - lo0 16384 10.255.253.12 appsrv1 0 -0 - - lo0 16384 10.255.253.10