Re: em 6.6.6 - watchdog timeout

2007-11-03 Thread Goran Lowkrantz

Hi Jack,

--On Thursday, November 01, 2007 13:36 -0700 Jack Vogel [EMAIL PROTECTED] 
wrote:



I should also note that this only applies to PCI-E NICs, 82571 and later.

Jack


Have tested
- enable MSI with the original 6.6.6 driver
- the new driver files you sent to -stable with and without MSI enabled

In all cases I can run all tests and programs that previous gave watchdog 
problems without any problems.


Thanks!

/glz



... the future isMobile

 Goran Lowkrantz [EMAIL PROTECTED]
 System Architect, isMobile AB
 Sandviksgatan 81, PO Box 58, S-971 03 LuleƄ, Sweden
 Mobile: +46(0)70-587 87 82
http://www.ismobile.com ...
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-11-01 Thread Jack Vogel
I should also note that this only applies to PCI-E NICs, 82571 and later.

Jack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-11-01 Thread Jack Vogel
On 10/21/07, Mike Tancsa [EMAIL PROTECTED] wrote:
 At 12:10 AM 10/21/2007, Mike Andrews wrote:

 I haven't tried the 6.6.6 driver on mine yet, though, so this could
 be something totally different.  I was going to bump one of them
 from RELENG_6 to RELENG_7 as a test soon.

 I see this problem running RELENG_6, which has the 6.6.6 driver. I
 forget the exact supermicro model #

 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Core(TM)2 CPU  6600  @ 2.40GHz (2402.50-MHz
 686-class CPU)
Origin = GenuineIntel  Id = 0x6f6  Stepping = 6

 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM
AMD Features=0x2010NX,LM
AMD Features2=0x1LAHF
Cores per package: 2
 real memory  = 2144329728 (2044 MB)
 avail memory = 2092859392 (1995 MB)
 ACPI APIC Table: INTEL  S3000AHX
 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
   cpu0 (BSP): APIC ID:  0
   cpu1 (AP): APIC ID:  1
 ioapic0: Changing APIC ID to 5
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 ioapic1 Version 2.0 irqs 30-53 on motherboard
 kbd1 at kbdmux0
 acpi0: INTEL S3000AHX on motherboard
 em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port
 0x3020-0x303f mem 0x8826-0x8827,0x8824-0x8825 irq 16
 at device 0.0 on pci1
 em0: Ethernet address: 00:15:17:12:f6:04
 em0: [FAST]
 em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port
 0x3000-0x301f mem 0x8822-0x8823,0x8820-0x8821 irq 17
 at device 0.1 on pci1
 em1: Ethernet address: 00:15:17:12:f6:05
 em1: [FAST]
 em2: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port
 0x2000-0x201f mem 0x8818-0x8819,0x8810-0x8817 irq 17
 at device 0.0 on pci5
 em2: Ethernet address: 00:15:17:29:6f:ef
 em2: [FAST]
 em3: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port
 0x1100-0x113f mem 0x8802-0x8803,0x8800-0x8801 irq 17
 at device 5.0 on pci6
 em3: Ethernet address: 00:15:17:29:6f:f0
 em3: [FAST]

 I already ran the dos util to fix the eeprom, but no difference.

I would like you all to try using MSI interrupts, watchdogs don't happen
when I do this. If you have hardware that has a system issue with MSI
then ignore this, but these SuperMicros systems should be fine.

First, you must enable it on the system:

sysctl hw.pci.enable_msi=1

Then you must reload the driver. If you use em static in the kernel you
will have to change the loader.conf to enable msi on boot.

I am going to add a display that will tell you when an adapter uses MSI or MSI/X
next time I check in code.

Not only does this solve my watchdog problems, I also find on the UDP_STREAM
test of netperf that I get better performance when using MSI.

Let me know how it works if you try this.

Cheers,

Jack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-10-21 Thread Mike Andrews

Jeremy Chadwick wrote:

On Sat, Oct 20, 2007 at 11:21:10AM +1300, Philip Murray wrote:
me too on a Supermicro 5015MT+, although I notice my em0 is also sharing 
an interrupt with USB (uhci3)... not sure if that's the culprit.


I'm not aware of a 5015MT+ model.  Maybe you mean 5015M-MT+ or
5015M-T+?


We've got five 5015M-MT+'s in production.  The PDSMI+ motherboard has 
two 82573's, one of which has a known EEPROM issue that Jack Vogel has 
sent a DOS-based utility around to fix up (search the archives for 
dcgdis.exe).  It fixed watchdog timeouts I was having way back in the 
6.2 beta cycle -- in my case they only happened when I had link at 1G 
and went away if I dropped to 100M.  It might be worth ruling that out 
first unless you've already done that. :)


I haven't tried the 6.6.6 driver on mine yet, though, so this could be 
something totally different.  I was going to bump one of them from 
RELENG_6 to RELENG_7 as a test soon.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-10-21 Thread Mike Tancsa

At 08:32 PM 10/19/2007, Jack Vogel wrote:

OK, I will look into this as soon as I can.



Just a Same here I think. The time outs dont correlate with load. 
The one nic is also seeing a lot of overruns.  Also, I checked the 
eeprom and its not an issue it seems



Oct 21 14:24:39 c1 kernel: Interface EEPROM Dump:
Oct 21 14:24:39 c1 kernel: Offset
Oct 21 14:24:39 c1 kernel: 0x  1500 1217 04f6 0420  5062  
Oct 21 14:24:39 c1 kernel: 0x0010  d508 6801 a42f 115e 8086 105e 8086 b165
Oct 21 14:24:39 c1 kernel: 0x0020  0008 105e 5400  5001   0100
Oct 21 14:24:39 c1 kernel: 0x0030  6cf6 37b0 07a6 8403 0783  c303 0602

[EMAIL PROTECTED]:0:0:   class=0x02 card=0x115e8086 chip=0x105e8086 
rev=0x06 hdr=0x00

vendor = 'Intel Corporation'
device = 'PRO/1000 PT'
class  = network
subclass   = ethernet
cap 01[c8] = powerspec 2  supports D0 D3  current D0
cap 05[d0] = MSI supports 1 message, 64 bit
cap 10[e0] = PCI-Express 1 endpoint
[EMAIL PROTECTED]:0:1:   class=0x02 card=0x115e8086 chip=0x105e8086 
rev=0x06 hdr=0x00

vendor = 'Intel Corporation'
device = 'PRO/1000 PT'
class  = network
subclass   = ethernet
cap 01[c8] = powerspec 2  supports D0 D3  current D0
cap 05[d0] = MSI supports 1 message, 64 bit
cap 10[e0] = PCI-Express 1 endpoint

[EMAIL PROTECTED]:0:0:   class=0x02 card=0x348d8086 chip=0x108c8086 
rev=0x03 hdr=0x00

vendor = 'Intel Corporation'
device = 'PRO/1000 PM'
class  = network
subclass   = ethernet
cap 01[c8] = powerspec 2  supports D0 D3  current D0
cap 05[d0] = MSI supports 1 message, 64 bit
cap 10[e0] = PCI-Express 1 endpoint

[EMAIL PROTECTED]:5:0:   class=0x02 card=0x348d8086 chip=0x10768086 
rev=0x05 hdr=0x00

vendor = 'Intel Corporation'
device = '82547EI Gigabit Ethernet Controller'
class  = network
subclass   = ethernet
cap 01[dc] = powerspec 2  supports D0 D3  current D0
cap 07[e4] = PCI-X supports 2048 burst read, 1 split transaction

Oct 21 02:15:01 c1 kernel: em1: Excessive collisions = 0
Oct 21 02:15:01 c1 kernel: em1: Sequence errors = 0
Oct 21 02:15:01 c1 kernel: em1: Defer count = 0
Oct 21 02:15:01 c1 kernel: em1: Missed Packets = 0
Oct 21 02:15:01 c1 kernel: em1: Receive No Buffers = 0
Oct 21 02:15:01 c1 kernel: em1: Receive Length Errors = 0
Oct 21 02:15:01 c1 kernel: em1: Receive errors = 0
Oct 21 02:15:01 c1 kernel: em1: Crc errors = 0
Oct 21 02:15:01 c1 kernel: em1: Alignment errors = 0
Oct 21 02:15:01 c1 kernel: em1: Collision/Carrier extension errors = 0
Oct 21 02:15:01 c1 kernel: em1: RX overruns = 0
Oct 21 02:15:01 c1 kernel: em1: watchdog timeouts = 2
Oct 21 02:15:01 c1 kernel: em1: XON Rcvd = 0
Oct 21 02:15:01 c1 kernel: em1: XON Xmtd = 0
Oct 21 02:15:01 c1 kernel: em1: XOFF Rcvd = 0
Oct 21 02:15:01 c1 kernel: em1: XOFF Xmtd = 0
Oct 21 02:15:01 c1 kernel: em1: Good Packets Rcvd = 972845
Oct 21 02:15:01 c1 kernel: em1: Good Packets Xmtd = 1056492
Oct 21 02:15:01 c1 kernel: em1: TSO Contexts Xmtd = 0
Oct 21 02:15:01 c1 kernel: em1: TSO Contexts Failed = 0
Oct 21 02:15:02 c1 kernel: em2: Excessive collisions = 0
Oct 21 02:15:02 c1 kernel: em2: Sequence errors = 0
Oct 21 02:15:02 c1 kernel: em2: Defer count = 0
Oct 21 02:15:02 c1 kernel: em2: Missed Packets = 146876
Oct 21 02:15:02 c1 kernel: em2: Receive No Buffers = 24633
Oct 21 02:15:02 c1 kernel: em2: Receive Length Errors = 0
Oct 21 02:15:02 c1 kernel: em2: Receive errors = 0
Oct 21 02:15:02 c1 kernel: em2: Crc errors = 0
Oct 21 02:15:02 c1 kernel: em2: Alignment errors = 0
Oct 21 02:15:02 c1 kernel: em2: Collision/Carrier extension errors = 0
Oct 21 02:15:02 c1 kernel: em2: RX overruns = 1261
Oct 21 02:15:02 c1 kernel: em2: watchdog timeouts = 0
Oct 21 02:15:02 c1 kernel: em2: XON Rcvd = 0
Oct 21 02:15:02 c1 kernel: em2: XON Xmtd = 0
Oct 21 02:15:02 c1 kernel: em2: XOFF Rcvd = 0
Oct 21 02:15:02 c1 kernel: em2: XOFF Xmtd = 0
Oct 21 02:15:02 c1 kernel: em2: Good Packets Rcvd = 11520947867
Oct 21 02:15:02 c1 kernel: em2: Good Packets Xmtd = 6732398306
Oct 21 02:15:02 c1 kernel: em2: TSO Contexts Xmtd = 0
Oct 21 02:15:02 c1 kernel: em2: TSO Contexts Failed = 0

# vmstat -i
interrupt  total   rate
irq1: atkbd0   5  0
irq4: sio0 39518  0
irq16: em0 935164706   1022
irq17: em1 em2 em3 802572934877
irq19: atapci1179422  0
cpu0: timer   1828072218   1997
cpu1: timer   1828031524   1997
Total 5394060327   5895




Jack


On 10/19/07, Philip Murray [EMAIL PROTECTED] wrote:

 On 20/10/2007, at 1:06 AM, Goran Lowkrantz wrote:

  Hi,
 
  After the update of em to 6.6.6 last, I experience watchdog timeouts
  on a server running 6-STABLE.
 

 me too on a Supermicro 5015MT+, although I notice my em0 is also
 

Re: em 6.6.6 - watchdog timeout

2007-10-21 Thread Mike Tancsa

At 12:10 AM 10/21/2007, Mike Andrews wrote:

I haven't tried the 6.6.6 driver on mine yet, though, so this could 
be something totally different.  I was going to bump one of them 
from RELENG_6 to RELENG_7 as a test soon.


I see this problem running RELENG_6, which has the 6.6.6 driver. I 
forget the exact supermicro model #


Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Core(TM)2 CPU  6600  @ 2.40GHz (2402.50-MHz 
686-class CPU)

  Origin = GenuineIntel  Id = 0x6f6  Stepping = 6
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM
  AMD Features=0x2010NX,LM
  AMD Features2=0x1LAHF
  Cores per package: 2
real memory  = 2144329728 (2044 MB)
avail memory = 2092859392 (1995 MB)
ACPI APIC Table: INTEL  S3000AHX
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
ioapic0: Changing APIC ID to 5
ioapic0 Version 2.0 irqs 0-23 on motherboard
ioapic1 Version 2.0 irqs 30-53 on motherboard
kbd1 at kbdmux0
acpi0: INTEL S3000AHX on motherboard
em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 
0x3020-0x303f mem 0x8826-0x8827,0x8824-0x8825 irq 16 
at device 0.0 on pci1

em0: Ethernet address: 00:15:17:12:f6:04
em0: [FAST]
em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 
0x3000-0x301f mem 0x8822-0x8823,0x8820-0x8821 irq 17 
at device 0.1 on pci1

em1: Ethernet address: 00:15:17:12:f6:05
em1: [FAST]
em2: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 
0x2000-0x201f mem 0x8818-0x8819,0x8810-0x8817 irq 17 
at device 0.0 on pci5

em2: Ethernet address: 00:15:17:29:6f:ef
em2: [FAST]
em3: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 
0x1100-0x113f mem 0x8802-0x8803,0x8800-0x8801 irq 17 
at device 5.0 on pci6

em3: Ethernet address: 00:15:17:29:6f:f0
em3: [FAST]

I already ran the dos util to fix the eeprom, but no difference.


---Mike 


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-10-19 Thread Goran Lowkrantz

[EMAIL PROTECTED] wrote:


Hi,


  snip



When running netstat between servers balder and midgard, server balder
get watchdog timeouts and resets the connection for a few seconds.
Oct 19 13:12:47 balder kernel: em0: watchdog timeout -- resetting


s/netstat/netperf/

... the future isMobile

 Goran Lowkrantz [EMAIL PROTECTED]
 System Architect, iaMobile AB
 Sandviksgatan 81, PO Box 58, S-971 03 LuleƄ, Sweden
 Mobile: +46(0)70-587 87 82
http://www.ismobile.com ...
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-10-19 Thread Jack Vogel
OK, I will look into this as soon as I can.

Jack


On 10/19/07, Philip Murray [EMAIL PROTECTED] wrote:

 On 20/10/2007, at 1:06 AM, Goran Lowkrantz wrote:

  Hi,
 
  After the update of em to 6.6.6 last, I experience watchdog timeouts
  on a server running 6-STABLE.
 

 me too on a Supermicro 5015MT+, although I notice my em0 is also
 sharing an interrupt with USB (uhci3)... not sure if that's the culprit.



 [EMAIL PROTECTED] ~]$ dmesg
 Copyright (c) 1992-2007 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
 The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 6.2-STABLE #0: Tue Oct  9 07:45:50 NZDT 2007
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC
 ACPI APIC Table: PTLTD  APIC  
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Xeon(R) CPU   X3220  @ 2.40GHz (2394.01-MHz 686-
 class CPU)
   Origin = GenuineIntel  Id = 0x6f7  Stepping = 7

 Features
 =
 0xbfebfbff
 
 FPU
 ,VME
 ,DE
 ,PSE
 ,TSC
 ,MSR
 ,PAE
 ,MCE
 ,CX8
 ,APIC
 ,SEP
 ,MTRR
 ,PGE
 ,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE

 Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM
   AMD Features=0x2010NX,LM
   AMD Features2=0x1LAHF
   Cores per package: 4
 real memory  = 2146304000 (2046 MB)
 avail memory = 2095353856 (1998 MB)
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 ioapic1 Version 2.0 irqs 24-47 on motherboard
 kbd1 at kbdmux0
 ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413,
 RF5413)
 acpi0: PTLTD   RSDT on motherboard
 acpi0: Power Button (fixed)
 Timecounter ACPI-fast frequency 3579545 Hz quality 1000
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
 cpu0: ACPI CPU on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge irq 16 at device 1.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pcib2: ACPI PCI-PCI bridge irq 17 at device 28.0 on pci0
 pci9: ACPI PCI bus on pcib2
 pcib3: ACPI PCI-PCI bridge at device 0.0 on pci9
 pci10: ACPI PCI bus on pcib3
 pcib4: PCI-PCI bridge at device 1.0 on pci10
 pci11: PCI bus on pcib4
 arcmsr0: Areca SATA Host Adapter RAID Controller
   mem 0xe020-0xe0200fff,0xe080-0xe0bf irq 26 at device
 14.0 on pci11
 ARECA RAID ADAPTER0: Driver Version 1.20.00.14 2007-2-05
 ARECA RAID ADAPTER0: FIRMWARE VERSION V1.42 2006-10-13
 pcib5: ACPI PCI-PCI bridge irq 17 at device 28.4 on pci0
 pci13: ACPI PCI bus on pcib5
 em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port
 0x4000-0x401f mem 0xe030-0xe031 irq 16 at device 0.0 on pci13
 em0: Ethernet address: 00:30:48:90:48:dc
 em0: [FAST]
 pcib6: ACPI PCI-PCI bridge irq 16 at device 28.5 on pci0
 pci14: ACPI PCI bus on pcib6
 em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port
 0x5000-0x501f mem 0xe040-0xe041 irq 17 at device 0.0 on pci14
 em1: Ethernet address: 00:30:48:90:48:dd
 em1: [FAST]
 uhci0: UHCI (generic) USB controller port 0x3000-0x301f irq 23 at
 device 29.0 on pci0
 uhci0: [GIANT-LOCKED]
 usb0: UHCI (generic) USB controller on uhci0
 usb0: USB revision 1.0
 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
 uhub0: 2 ports with 2 removable, self powered
 uhci1: UHCI (generic) USB controller port 0x3020-0x303f irq 19 at
 device 29.1 on pci0
 uhci1: [GIANT-LOCKED]
 usb1: UHCI (generic) USB controller on uhci1
 usb1: USB revision 1.0
 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
 uhub1: 2 ports with 2 removable, self powered
 uhci2: UHCI (generic) USB controller port 0x3040-0x305f irq 18 at
 device 29.2 on pci0
 uhci2: [GIANT-LOCKED]
 usb2: UHCI (generic) USB controller on uhci2
 usb2: USB revision 1.0
 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
 uhub2: 2 ports with 2 removable, self powered
 uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at
 device 29.3 on pci0
 uhci3: [GIANT-LOCKED]
 usb3: UHCI (generic) USB controller on uhci3
 usb3: USB revision 1.0
 uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
 uhub3: 2 ports with 2 removable, self powered
 ehci0: Intel 82801GB/R (ICH7) USB 2.0 controller mem
 0xe000-0xe3ff irq 23 at device 29.7 on pci0
 ehci0: [GIANT-LOCKED]
 usb4: EHCI version 1.0
 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3
 usb4: Intel 82801GB/R (ICH7) USB 2.0 controller on ehci0
 usb4: USB revision 2.0
 uhub4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
 uhub4: 8 ports with 8 removable, self powered
 pcib7: ACPI PCI-PCI bridge at device 30.0 on pci0
 pci15: ACPI PCI bus on pcib7
 pci15: display, VGA at device 0.0 (no driver attached)
 isab0: PCI-ISA bridge at device 31.0 on pci0
 isa0: ISA bus on isab0
 atapci0: Intel ICH7 UDMA100 controller port
 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x30a0-0x30af at device 31.1 on pci0
 ata0: ATA channel 0 on atapci0
 ata1: ATA channel 1 

Re: em 6.6.6 - watchdog timeout

2007-10-19 Thread Philip Murray


On 20/10/2007, at 5:03 PM, Jeremy Chadwick wrote:


On Sat, Oct 20, 2007 at 11:21:10AM +1300, Philip Murray wrote:
me too on a Supermicro 5015MT+, although I notice my em0 is also  
sharing

an interrupt with USB (uhci3)... not sure if that's the culprit.


I'm not aware of a 5015MT+ model.  Maybe you mean 5015M-MT+ or
5015M-T+?

We have two 5015M-T+ boxes, one running RELENG_7 and one running
RELENG_6, and neither exhibit this problem.  Here's relevant em(4)
information from both boxes; I do find the vmstat -i IRQ on the 2nd
box a little odd, but operationally it seems fine.

box1 (RELENG_6)
===
em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port  
0x4000-0x401f mem 0xe020-0xe021 irq 16 at device 0.0 on pci13
em1: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port  
0x5000-0x501f mem 0xe030-0xe031 irq 17 at device 0.0 on pci14


irq16: em0 uhci3   117371806 11
irq17: em1  49605005  4


box2 (RELENG_7)
===
em0: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port  
0x4000-0x401f mem 0xe800-0xe801 irq 16 at device 0.0 on pci13
em1: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port  
0x5000-0x501f mem 0xe820-0xe821 irq 17 at device 0.0 on pci14


irq256: em0   502854  4
irq257: em1 5438  0


On box2, IRQ 16 is also shared with uhci3 and vgapci0, but vmstat
doesn't seem to show that.

uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at  
device 29.3 on pci0
vgapci0: VGA-compatible display port 0x6000-0x60ff mem  
0xe000-0xe7ff,0xe830-0xe830 irq 16 at device 0.0 on  
pci15




Hi Jeremy

You don't seem to be running the 6.6.6 driver, which is when the  
watchdog timeouts started occurring. Had no problems with 6.2.9.


Cheers

Phil

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-10-19 Thread Jeremy Chadwick
On Sat, Oct 20, 2007 at 11:21:10AM +1300, Philip Murray wrote:
 me too on a Supermicro 5015MT+, although I notice my em0 is also sharing 
 an interrupt with USB (uhci3)... not sure if that's the culprit.

I'm not aware of a 5015MT+ model.  Maybe you mean 5015M-MT+ or
5015M-T+?

We have two 5015M-T+ boxes, one running RELENG_7 and one running
RELENG_6, and neither exhibit this problem.  Here's relevant em(4)
information from both boxes; I do find the vmstat -i IRQ on the 2nd
box a little odd, but operationally it seems fine.

box1 (RELENG_6)
===
em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0x4000-0x401f 
mem 0xe020-0xe021 irq 16 at device 0.0 on pci13
em1: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 0x5000-0x501f 
mem 0xe030-0xe031 irq 17 at device 0.0 on pci14

irq16: em0 uhci3   117371806 11
irq17: em1  49605005  4


box2 (RELENG_7)
===
em0: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port 0x4000-0x401f 
mem 0xe800-0xe801 irq 16 at device 0.0 on pci13
em1: Intel(R) PRO/1000 Network Connection Version - 6.5.3 port 0x5000-0x501f 
mem 0xe820-0xe821 irq 17 at device 0.0 on pci14

irq256: em0   502854  4
irq257: em1 5438  0


On box2, IRQ 16 is also shared with uhci3 and vgapci0, but vmstat
doesn't seem to show that.

uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at device 29.3 
on pci0
vgapci0: VGA-compatible display port 0x6000-0x60ff mem 
0xe000-0xe7ff,0xe830-0xe830 irq 16 at device 0.0 on pci15

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: em 6.6.6 - watchdog timeout

2007-10-19 Thread Philip Murray


On 20/10/2007, at 1:06 AM, Goran Lowkrantz wrote:


Hi,

After the update of em to 6.6.6 last, I experience watchdog timeouts  
on a server running 6-STABLE.




me too on a Supermicro 5015MT+, although I notice my em0 is also  
sharing an interrupt with USB (uhci3)... not sure if that's the culprit.




[EMAIL PROTECTED] ~]$ dmesg
Copyright (c) 1992-2007 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.2-STABLE #0: Tue Oct  9 07:45:50 NZDT 2007
   [EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC
ACPI APIC Table: PTLTD   APIC  
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(R) CPU   X3220  @ 2.40GHz (2394.01-MHz 686- 
class CPU)

 Origin = GenuineIntel  Id = 0x6f7  Stepping = 7
  
Features 
= 
0xbfebfbff 
 
FPU 
,VME 
,DE 
,PSE 
,TSC 
,MSR 
,PAE 
,MCE 
,CX8 
,APIC 
,SEP 
,MTRR 
,PGE 
,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  
Features2=0xe3bdSSE3,RSVD2,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM

 AMD Features=0x2010NX,LM
 AMD Features2=0x1LAHF
 Cores per package: 4
real memory  = 2146304000 (2046 MB)
avail memory = 2095353856 (1998 MB)
ioapic0 Version 2.0 irqs 0-23 on motherboard
ioapic1 Version 2.0 irqs 24-47 on motherboard
kbd1 at kbdmux0
ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413,  
RF5413)

acpi0: PTLTD   RSDT on motherboard
acpi0: Power Button (fixed)
Timecounter ACPI-fast frequency 3579545 Hz quality 1000
acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
cpu0: ACPI CPU on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge irq 16 at device 1.0 on pci0
pci1: ACPI PCI bus on pcib1
pcib2: ACPI PCI-PCI bridge irq 17 at device 28.0 on pci0
pci9: ACPI PCI bus on pcib2
pcib3: ACPI PCI-PCI bridge at device 0.0 on pci9
pci10: ACPI PCI bus on pcib3
pcib4: PCI-PCI bridge at device 1.0 on pci10
pci11: PCI bus on pcib4
arcmsr0: Areca SATA Host Adapter RAID Controller
 mem 0xe020-0xe0200fff,0xe080-0xe0bf irq 26 at device  
14.0 on pci11

ARECA RAID ADAPTER0: Driver Version 1.20.00.14 2007-2-05
ARECA RAID ADAPTER0: FIRMWARE VERSION V1.42 2006-10-13
pcib5: ACPI PCI-PCI bridge irq 17 at device 28.4 on pci0
pci13: ACPI PCI bus on pcib5
em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port  
0x4000-0x401f mem 0xe030-0xe031 irq 16 at device 0.0 on pci13

em0: Ethernet address: 00:30:48:90:48:dc
em0: [FAST]
pcib6: ACPI PCI-PCI bridge irq 16 at device 28.5 on pci0
pci14: ACPI PCI bus on pcib6
em1: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port  
0x5000-0x501f mem 0xe040-0xe041 irq 17 at device 0.0 on pci14

em1: Ethernet address: 00:30:48:90:48:dd
em1: [FAST]
uhci0: UHCI (generic) USB controller port 0x3000-0x301f irq 23 at  
device 29.0 on pci0

uhci0: [GIANT-LOCKED]
usb0: UHCI (generic) USB controller on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: UHCI (generic) USB controller port 0x3020-0x303f irq 19 at  
device 29.1 on pci0

uhci1: [GIANT-LOCKED]
usb1: UHCI (generic) USB controller on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2: UHCI (generic) USB controller port 0x3040-0x305f irq 18 at  
device 29.2 on pci0

uhci2: [GIANT-LOCKED]
usb2: UHCI (generic) USB controller on uhci2
usb2: USB revision 1.0
uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhci3: UHCI (generic) USB controller port 0x3060-0x307f irq 16 at  
device 29.3 on pci0

uhci3: [GIANT-LOCKED]
usb3: UHCI (generic) USB controller on uhci3
usb3: USB revision 1.0
uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub3: 2 ports with 2 removable, self powered
ehci0: Intel 82801GB/R (ICH7) USB 2.0 controller mem  
0xe000-0xe3ff irq 23 at device 29.7 on pci0

ehci0: [GIANT-LOCKED]
usb4: EHCI version 1.0
usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3
usb4: Intel 82801GB/R (ICH7) USB 2.0 controller on ehci0
usb4: USB revision 2.0
uhub4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub4: 8 ports with 8 removable, self powered
pcib7: ACPI PCI-PCI bridge at device 30.0 on pci0
pci15: ACPI PCI bus on pcib7
pci15: display, VGA at device 0.0 (no driver attached)
isab0: PCI-ISA bridge at device 31.0 on pci0
isa0: ISA bus on isab0
atapci0: Intel ICH7 UDMA100 controller port  
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x30a0-0x30af at device 31.1 on pci0

ata0: ATA channel 0 on atapci0
ata1: ATA channel 1 on atapci0
pci0: serial bus, SMBus at device 31.3 (no driver attached)
acpi_button0: Power Button on acpi0
sio0: 16550A-compatible COM port port 0x3f8-0x3ff irq 4 flags 0x10  
on 

em 6.6.6 - watchdog timeout

2007-10-19 Thread Goran Lowkrantz

Hi,

After the update of em to 6.6.6 last, I experience watchdog timeouts on a 
server running 6-STABLE.


I have two identical servers with Intel D915GAV boards. Both have Intel 
PRO/1000 PCI-Express network cards.


Server balder:
em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port 
0xac00-0xac1f mem 0xff60-0xff61,0xff62-0xff63 irq 16 at 
device 0.0 on pci5

em0: Ethernet address: 00:1b:21:00:48:c4
em0: [FAST]

# vmstat -i
interrupt  total   rate
irq1: atkbd0   3  0
irq4: sio0 2  0
irq6: fdc012  0
irq14: ata0   68  0
irq16: em0 uhci3   219828879450
irq19: uhci1++   4287947  8
irq22: ahc0232717293476
irq23: uhci0 ehci0 1  0
cpu0: timer976552804   2000
Total 1433387009   2935

# netstat -i
NameMtu Network   Address  Ipkts IerrsOpkts Oerrs 
Coll
em01500 Link#1  00:1b:21:00:48:c4 209880531   773 2062284 
0
em01500 10.255.253/24 balder215210996 - 212337968 - 
-
plip0  1500 Link#2   0 00 0 
0
lo0   16384 Link#312040055 0 12055326 0 
0
lo0   16384 fe80:3::1 fe80:3::10 -0 - 
-
lo0   16384 localhost ::1  6 -6 - 
-
lo0   16384 your-net  localhost  6249979 -  6249980 - 
-


00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory 
Controller Hub (rev 04)
00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express 
Root Port (rev 04)
00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL 
Integrated Graphics Controller (rev 04)
00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) 
PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) 
PCI Express Port 2 (rev 03)
00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) 
PCI Express Port 3 (rev 03)
00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) 
PCI Express Port 4 (rev 03)
00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 
Family) USB UHCI #1 (rev 03)
00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 
Family) USB UHCI #2 (rev 03)
00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 
Family) USB UHCI #3 (rev 03)
00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 
Family) USB UHCI #4 (rev 03)
00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 
Family) USB2 EHCI Controller (rev 03)

00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3)
00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC Interface 
Bridge (rev 03)
00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 
Family) IDE Controller (rev 03)
00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA 
Controller (rev 03)
00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) SMBus 
Controller (rev 03)
05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet 
Controller (Copper) (rev 06)

06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev 01)


Server midgard:
em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port 
0xac00-0xac1f mem 0xff50-0xff51,0xff52-0xff53 irq 16 at 
device 0.0 on pci5

em0: Ethernet address: 00:15:17:0e:05:f7
[EMAIL PROTECTED] vmstat -i
interrupt  total   rate
irq1: atkbd0  11  0
irq4: sio0   2142746  0
irq6: fdc014  0
irq14: ata0  252  0
irq16: em0+40101164
irq19: atapci1+  7932757  1
irq22: ahc0 87074425 21
cpu0: timer   3807810138937
Total 4571600444   1125

[EMAIL PROTECTED] netstat -i
NameMtu Network   Address  Ipkts IerrsOpkts Oerrs 
Coll
em01500 Link#1  00:15:17:0e:05:f7 343771280 0 474609731 0 
0
em01500 10.255.253/24 midgard   347467842 - 478700485 - 
-
plip0  1500 Link#2   0 00 0 
0
lo0   16384 Link#316821054 0 16947668 0 
0
lo0   16384 fe80:3::1 fe80:3::10 -0 - 
-
lo0   16384 localhost ::1   2610 - 2610 - 
-
lo0   16384 your-net  localhost 12616879 - 12616879 - 
-
lo0   16384 10.255.253.12 appsrv1  0 -0 - 
-

Re: em 6.6.6 - watchdog timeout

2007-10-19 Thread Goran Lowkrantz

[EMAIL PROTECTED] wrote:


Hi,

After the update of em to 6.6.6 last, I experience watchdog timeouts on a
server running 6-STABLE.

I have two identical servers with Intel D915GAV boards. Both have Intel
PRO/1000 PCI-Express network cards.

Server balder:
em0: Intel(R) PRO/1000 Network Connection Version - 6.6.6 port
0xac00-0xac1f mem 0xff60-0xff61,0xff62-0xff63 irq 16 at
device 0.0 on pci5
em0: Ethernet address: 00:1b:21:00:48:c4
em0: [FAST]

# vmstat -i
interrupt  total   rate
irq1: atkbd0   3  0
irq4: sio0 2  0
irq6: fdc012  0
irq14: ata0   68  0
irq16: em0 uhci3   219828879450
irq19: uhci1++   4287947  8
irq22: ahc0232717293476
irq23: uhci0 ehci0 1  0
cpu0: timer976552804   2000
Total 1433387009   2935

# netstat -i
NameMtu Network   Address  Ipkts IerrsOpkts Oerrs
Coll
em01500 Link#1  00:1b:21:00:48:c4 209880531   773 20622
84 0
em01500 10.255.253/24 balder215210996 - 212337968
- -
plip0  1500 Link#2   0 00 0
0
lo0   16384 Link#312040055 0 12055326 0
0
lo0   16384 fe80:3::1 fe80:3::10 -0 -
-
lo0   16384 localhost ::1  6 -6 -
-
lo0   16384 your-net  localhost  6249979 -  6249980 -
-

00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory
Controller Hub (rev 04)
00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express
Root Port (rev 04)
00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL
Integrated Graphics Controller (rev 04)
00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
PCI Express Port 2 (rev 03)
00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
PCI Express Port 3 (rev 03)
00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
PCI Express Port 4 (rev 03)
00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6
Family) USB UHCI #1 (rev 03)
00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6
Family) USB UHCI #2 (rev 03)
00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6
Family) USB UHCI #3 (rev 03)
00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6
Family) USB UHCI #4 (rev 03)
00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6
Family) USB2 EHCI Controller (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3)
00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC
Interface Bridge (rev 03)
00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6
Family) IDE Controller (rev 03)
00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA
Controller (rev 03)
00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
SMBus Controller (rev 03)
05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet
Controller (Copper) (rev 06)
06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev
01)


Server midgard:
em0: Intel(R) PRO/1000 Network Connection Version - 6.2.9 port
0xac00-0xac1f mem 0xff50-0xff51,0xff52-0xff53 irq 16 at
device 0.0 on pci5
em0: Ethernet address: 00:15:17:0e:05:f7
[EMAIL PROTECTED] vmstat -i
interrupt  total   rate
irq1: atkbd0  11  0
irq4: sio0   2142746  0
irq6: fdc014  0
irq14: ata0  252  0
irq16: em0+40101164
irq19: atapci1+  7932757  1
irq22: ahc0 87074425 21
cpu0: timer   3807810138937
Total 4571600444   1125

[EMAIL PROTECTED] netstat -i
NameMtu Network   Address  Ipkts IerrsOpkts Oerrs
Coll
em01500 Link#1  00:15:17:0e:05:f7 343771280 0 474609731
0 0
em01500 10.255.253/24 midgard   347467842 - 478700485
- -
plip0  1500 Link#2   0 00 0
0
lo0   16384 Link#316821054 0 16947668 0
0
lo0   16384 fe80:3::1 fe80:3::10 -0 -
-
lo0   16384 localhost ::1   2610 - 2610 -
-
lo0   16384 your-net  localhost 12616879 - 12616879 -
-
lo0   16384 10.255.253.12 appsrv1  0 -0 -
-
lo0   16384 10.255.253.10