Re: Supermicro Bladeserver

2011-01-11 Thread Robin Sommer

On Wed, Jan 12, 2011 at 03:13 -, you wrote:

 Out of interest what change was that?

As what seems to have been a left-over from a debugging session a
long time ago, I had MSI disabled in loader.conf. That's not
supported by the driver. So simply reenabling that solved my
problem.

Robin

-- 
Robin Sommer * Phone +1 (510) 722-6541 * ro...@icir.org
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to freebsd-net-unsubscr...@freebsd.org


Re: igb watchdog timeouts

2011-01-03 Thread Robin Sommer
Hello all,

quite a while ago I asked about the problem below. Unfortunately, I
haven't found a solution yet and I'm actually still seeing these
timeouts after just upgrading to 8.2-RC1. Any further ideas on what
could be triggering them, or how I could track down the cause?

Thanks,

Robin

On Thu, Jul 29, 2010 at 14:56 -0700, I wrote:

 Since upgrading from 8.0 to 8.1-RELEASE, I'm seeing lots of messages
 like those below on all my SuperMicro SBI-7425C-T3 blades. There's
 almost no traffic on those interfaces. 
 
 Any idea?
 
 Thanks,
 
 Robin
 
 Jul 29 13:01:18 blade0 kernel: igb1: Watchdog timeout -- resetting
 Jul 29 13:01:18 blade0 kernel: igb1: Queue(0) tdh = 256, hw tdt = 266
 Jul 29 13:01:18 blade0 kernel: igb1: TX(0) desc avail = 1013,Next TX to Clean 
 = 255
 Jul 29 13:01:18 blade0 kernel: igb1: link state changed to DOWN
 Jul 29 13:01:18 blade0 kernel: igb1: link state changed to UP
 Jul 29 13:01:29 blade0 kernel: igb1: Watchdog timeout -- resetting
 Jul 29 13:01:29 blade0 kernel: igb1: Queue(0) tdh = 0, hw tdt = 10
 Jul 29 13:01:29 blade0 kernel: igb1: TX(0) desc avail = 1014,Next TX to Clean 
 = 0
 Jul 29 13:01:29 blade0 kernel: igb1: link state changed to DOWN
 Jul 29 13:01:29 blade0 kernel: igb1: link state changed to UP
 Jul 29 13:01:46 blade0 kernel: igb1: Watchdog timeout -- resetting
 Jul 29 13:01:46 blade0 kernel: igb1: Queue(0) tdh = 32, hw tdt = 33
 Jul 29 13:01:46 blade0 kernel: igb1: TX(0) desc avail = 1022,Next TX to Clean 
 = 31
 Jul 29 13:01:46 blade0 kernel: igb1: link state changed to DOWN
 Jul 29 13:01:46 blade0 kernel: igb1: link state changed to UP
 Jul 29 13:01:57 blade0 kernel: igb1: Watchdog timeout -- resetting
 Jul 29 13:01:57 blade0 kernel: igb1: Queue(0) tdh = 0, hw tdt = 10
 Jul 29 13:01:57 blade0 kernel: igb1: TX(0) desc avail = 1014,Next TX to Clean 
 = 0
 Jul 29 13:01:57 blade0 kernel: igb1: link state changed to DOWN
 Jul 29 13:01:58 blade0 kernel: igb1: link state changed to UP
 Jul 29 13:02:13 blade0 kernel: igb1: Watchdog timeout -- resetting
 
  grep igb /var/run/dmesg.boot
 igb0: Intel(R) PRO/1000 Network Connection version - 1.9.5 port 
 0x2000-0x201f mem 
 0xfc94-0xfc95,0xfc92-0xfc93,0xfc90-0xfc903fff irq 16 at 
 device 0.0 on pci4
 igb0: [FILTER]
 igb0: Ethernet address: 00:30:48:9e:22:00
 igb1: Intel(R) PRO/1000 Network Connection version - 1.9.5 port 
 0x2020-0x203f mem 
 0xfc98-0xfc99,0xfc96-0xfc97,0xfc904000-0xfc907fff irq 17 at 
 device 0.1 on pci4
 igb1: [FILTER]
 igb1: Ethernet address: 00:30:48:9e:22:01
 
  pciconf -lv 
 [...]
 i...@pci0:4:0:0: class=0x02 card=0x10a915d9
 chip=0x10a98086 rev=0x02 hdr=0x00
 vendor = 'Intel Corporation'
 device = '82575EB Gigabit Backplane Connection'
 class  = network
 subclass   = ethernet
 i...@pci0:4:0:1:class=0x02 card=0x10a915d9
 chip=0x10a98086 rev=0x02 hdr=0x00
 vendor = 'Intel Corporation'
 device = '82575EB Gigabit Backplane Connection'
 class  = network
 subclass   = ethernet
 [...]

-- 
Robin Sommer * Phone +1 (510) 722-6541 * ro...@icir.org
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to freebsd-net-unsubscr...@freebsd.org


igb watchdog timeouts

2010-07-29 Thread Robin Sommer
Since upgrading from 8.0 to 8.1-RELEASE, I'm seeing lots of messages
like those below on all my SuperMicro SBI-7425C-T3 blades. There's
almost no traffic on those interfaces. 

Any idea?

Thanks,

Robin

Jul 29 13:01:18 blade0 kernel: igb1: Watchdog timeout -- resetting
Jul 29 13:01:18 blade0 kernel: igb1: Queue(0) tdh = 256, hw tdt = 266
Jul 29 13:01:18 blade0 kernel: igb1: TX(0) desc avail = 1013,Next TX to Clean = 
255
Jul 29 13:01:18 blade0 kernel: igb1: link state changed to DOWN
Jul 29 13:01:18 blade0 kernel: igb1: link state changed to UP
Jul 29 13:01:29 blade0 kernel: igb1: Watchdog timeout -- resetting
Jul 29 13:01:29 blade0 kernel: igb1: Queue(0) tdh = 0, hw tdt = 10
Jul 29 13:01:29 blade0 kernel: igb1: TX(0) desc avail = 1014,Next TX to Clean = 0
Jul 29 13:01:29 blade0 kernel: igb1: link state changed to DOWN
Jul 29 13:01:29 blade0 kernel: igb1: link state changed to UP
Jul 29 13:01:46 blade0 kernel: igb1: Watchdog timeout -- resetting
Jul 29 13:01:46 blade0 kernel: igb1: Queue(0) tdh = 32, hw tdt = 33
Jul 29 13:01:46 blade0 kernel: igb1: TX(0) desc avail = 1022,Next TX to Clean = 
31
Jul 29 13:01:46 blade0 kernel: igb1: link state changed to DOWN
Jul 29 13:01:46 blade0 kernel: igb1: link state changed to UP
Jul 29 13:01:57 blade0 kernel: igb1: Watchdog timeout -- resetting
Jul 29 13:01:57 blade0 kernel: igb1: Queue(0) tdh = 0, hw tdt = 10
Jul 29 13:01:57 blade0 kernel: igb1: TX(0) desc avail = 1014,Next TX to Clean = 0
Jul 29 13:01:57 blade0 kernel: igb1: link state changed to DOWN
Jul 29 13:01:58 blade0 kernel: igb1: link state changed to UP
Jul 29 13:02:13 blade0 kernel: igb1: Watchdog timeout -- resetting

 grep igb /var/run/dmesg.boot
igb0: Intel(R) PRO/1000 Network Connection version - 1.9.5 port 0x2000-0x201f 
mem 0xfc94-0xfc95,0xfc92-0xfc93,0xfc90-0xfc903fff irq 16 at 
device 0.0 on pci4
igb0: [FILTER]
igb0: Ethernet address: 00:30:48:9e:22:00
igb1: Intel(R) PRO/1000 Network Connection version - 1.9.5 port 0x2020-0x203f 
mem 0xfc98-0xfc99,0xfc96-0xfc97,0xfc904000-0xfc907fff irq 17 at 
device 0.1 on pci4
igb1: [FILTER]
igb1: Ethernet address: 00:30:48:9e:22:01

 pciconf -lv 
[...]
i...@pci0:4:0:0: class=0x02 card=0x10a915d9
chip=0x10a98086 rev=0x02 hdr=0x00
vendor = 'Intel Corporation'
device = '82575EB Gigabit Backplane Connection'
class  = network
subclass   = ethernet
i...@pci0:4:0:1:class=0x02 card=0x10a915d9
chip=0x10a98086 rev=0x02 hdr=0x00
vendor = 'Intel Corporation'
device = '82575EB Gigabit Backplane Connection'
class  = network
subclass   = ethernet
[...]


-- 
Robin Sommer * Phone +1 (510) 666-2886 * ro...@icir.org 
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to freebsd-net-unsubscr...@freebsd.org


Re: Crashes with ixgbe on FreeBSD 8

2010-05-26 Thread Robin Sommer

On Tue, May 25, 2010 at 12:03 -0700, I wrote:

 /usr/src/sys/dev/ixgbe/ixgbe.c:833: warning: implicit declaration of function 
 'drbr_needs_enqueue'

So, I've now simply copied that function over and, indeed, that not
only compiles but gives me a working 2.2.0 driver, enabling me to
capture traffic in a stable fashion as far as I can tell so far ...

My question: is simply pasting this function into RELEASE a
reasonable fix, or am I missing out on further changes and am thus
getting myself into trouble in some form?

Robin

-- 
Robin Sommer * Phone +1 (510) 666-2886 * ro...@icir.org 
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to freebsd-net-unsubscr...@freebsd.org


Re: Crashes with ixgbe on FreeBSD 8

2010-05-25 Thread Robin Sommer

On Wed, May 19, 2010 at 09:47 -0700, you wrote:

 issue this morning but I will try and check the change into HEAD
 this afternoon then you will able to just pull that and the driver
 will drop into 8 REL with no problem.

I'm trying to compile 2.2.0 with RELEASE, but I get the following
error:

/usr/src/sys/dev/ixgbe/ixgbe.c: In function 'ixgbe_mq_start_locked':
/usr/src/sys/dev/ixgbe/ixgbe.c:833: warning: implicit declaration of function 
'drbr_needs_enqueue'
/usr/src/sys/dev/ixgbe/ixgbe.c:833: warning: nested extern declaration of 
'drbr_needs_enqueue'

Is there an easy way to get around this?

Thanks,

Robin

-- 
Robin Sommer * Phone +1 (510) 666-2886 * ro...@icir.org 
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to freebsd-net-unsubscr...@freebsd.org


Re: Crashes with ixgbe on FreeBSD 8

2010-05-19 Thread Robin Sommer

On Tue, May 18, 2010 at 22:28 -0700, you wrote:

 Yes, download the latest code, oh hmmm, can  you install STABLE,
 because there's one small issue that will cause a problem on 8 REL,
 but if that's a big problem I can also give you a patch so it will work.

Jack, thanks for the reply. I could upgrade to STABLE but I as I
have quite a number of machines in my setup (and not not all of them
need the driver) I would prefer to stay with RELEASE if that is an
option. So if you could send me a patch, that would be very helpful.

 What I'd like to see you try is the newest code that I checked into
 HEAD today, that's what I am planning for 8.1 and I want as much
 testing as I can get anyway.

Sure, I'd be happy to try that. Can this be back-ported to RELEASE
as well or does it need HEAD?

Thanks!

Robin

-- 
Robin Sommer * Phone +1 (510) 666-2886 * ro...@icir.org 
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to freebsd-net-unsubscr...@freebsd.org


Crashes with ixgbe on FreeBSD 8

2010-05-18 Thread Robin Sommer
Hi all,

I'm running into crashes with the ixgbe driver on a FreeBSD
8.0-RELEASE system equipped with an Intel X520-SR2 (and using a
Supermicro X8DTU-F motherboard). I was wondering whether somebody
might have seen this before.

I updated the driver to the latest version I found on Intel's
download page, which is 2.0.1. However, the moment I run tcpdump on
my ix1 interface (which is connected to a switch's monitor port), I
see almost immediate crashes of the kind below. Sometimes tcpdump
outputs a few 10s of packets before the system crashes, but most of
the time it doesn't even get that far.

I saw that 8.0-STABLE comes with a more recent version of the
driver. Any chance that that might fix the problem? Any other
thoughts?

Thanks,

Robin

- cut ---

syslog from crash:

May 14 16:42:24 bart kernel: ix1: link state changed to UP  
 
May 14 16:43:34 bart kernel: ix1: promiscuous mode enabled  
 
May 14 16:43:35 bart kernel:
 
May 14 16:43:35 bart kernel:
 
May 14 16:43:35 bart kernel: Fatal trap 12: page fault while in kernel mode 
 
May 14 16:43:35 bart kernel: cpuid = 5; apic id = 05
 
May 14 16:43:35 bart kernel: fault virtual address  = 0x80403d38d3b8
 
May 14 16:43:35 bart kernel: fault code = supervisor read data, page 
not present 
May 14 16:43:35 bart kernel: instruction pointer= 
0x20:0x808437fe
May 14 16:43:35 bart kernel:
 
May 14 16:43:35 bart kernel: stack pointer  = 
0x28:0xff82b34d5970
May 14 16:43:35 bart kernel: frame pointer  = 
0x28:0xff82b34d5980
May 14 16:43:35 bart kernel: code segment   = base 0x0, limit 
0xf, type 0x1b 
May 14 16:43:35 bart kernel: = DPL 0, pres 1, long 1, def32 0, gran 1   
 
May 14 16:43:35 bart kernel: processor eflags   = interrupt enabled,


Driver initalization:

ix0: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.0.1 port 
0xec00-0xec1f mem 0xf8f8-0xf8ff,0xf8f7c006
ix0: Using MSIX interrupts with 33 vectors
ix0: [ITHREAD]
[...]
ix0: Ethernet address: 00:1b:21:55:7f:d8
ix0: PCI Express Bus: Speed 5.0Gb/s Width x8
ix1: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.0.1 port 
0xe880-0xe89f mem 0xf8e8-0xf8ef,0xf8e7c006
ix1: Using MSIX interrupts with 33 vectors
ix1: [ITHREAD]
[...]
ix1: Ethernet address: 00:1b:21:55:7f:d9
ix1: PCI Express Bus: Speed 5.0Gb/s Width x8

ifconfig ix1

ix1: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

options=5bbRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,LRO
ether 00:1b:21:55:7f:d9
media: Ethernet autoselect (10Gbase-SR full-duplex)
status: active



-- 
Robin Sommer * Phone +1 (510) 666-2886 * ro...@icir.org 
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to freebsd-net-unsubscr...@freebsd.org


Re: BPF problems on FreeBSD 7.0

2008-07-15 Thread Robin Sommer

On Mon, Jul 14, 2008 at 14:44 +0100, Bruce M. Simpson wrote:

 One place to start might be: netstat -B output in 7.x (I *think* this got 
 MFCed), this will let us see what the drop count is for the Bro process, 
 and what the flags are for the open BPF descriptors in the system.

Thanks for the suggestion. Here's the netstat -B output at the time
it has stalled (after about 6 hours of working normally):

   Pid  Netif   Flags  Recv  Drop Match Sblen Hblen Command
14557  nxge0 p--s--- 2162189525  32514465  42815457 4194248 4194258 bro

Top shows:

  PID USERNAME  THR PRI NICE   SIZERES STATE  C   TIME   WCPU COMMAND
14557 bro1 -580272M   267M5  25:53  0.00% bro



A few minutes after starting the process, when Bro was still working
fine, a netstat -B output was:

# netstat -B
  Pid  Netif   Flags  Recv  Drop Match Sblen Hblen Command
14557  nxge0 p--s---   4779235 0 94967 0 0 bro

Thanks,

Robin

-- 
Robin Sommer * Phone +1 (510) 666-2886 * [EMAIL PROTECTED] 
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: BPF problems on FreeBSD 7.0

2008-07-15 Thread Robin Sommer

On Tue, Jul 15, 2008 at 14:25 -0700, you wrote:

 Thanks for the suggestion. Here's the netstat -B output at the time
 it has stalled (after about 6 hours of working normally):
[...]
 at your rate of receiving packets, it passed that value about
 2 minutes before this snapshot was taken..

Sorry, I wasn't precise: the process stalled after about 6 hours but
the netstat output is actually from much later (the next day in
fact, because it stalled latet a night) when it was still in that
state.  

Robin

-- 
Robin Sommer * Phone +1 (510) 666-2886 * [EMAIL PROTECTED] 
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to [EMAIL PROTECTED]


BPF problems on FreeBSD 7.0

2008-07-11 Thread Robin Sommer
Hi all,

we're seeing some strange effects with our libpcap-based application
(the Bro network intrusion detection system) on a FreeBSD 7-RELEASE
system. As the application has always been running fine on 6.x,
we're wondering whether this might be triggered by any of the
changes that went into 7.

The problem is that the Bro process, after running fine for a few
hours or so, regularly stalls completely; the process seems to enter
some odd state, using 0% CPU and with top showing only an empty
field in the STATE column.

We saw this effect with a Neterion network card and first thought it
might be a driver problem. After switching to an Intel card, we see
something slightly different: now the process doesn't stall
completely anymore but it still gets to some point at which it stops
receiving packets from libpcap.

We haven't yet seen these problems with any other libpcap
application. The only difference between Bro and most other libpcap
applications that I can think of right now, is that Bro is using
select() on the file descriptor. However, with a small test
applicaton which mimics Bro's way of using libpcap, we couldn't
reproduce the problem so far either.

With the Neterion card, we have also tried disabling LRO and MSI
explicitly but to no avail.

Again, this is all with a Bro installation that works fine when
running FreeBSD 6.x (we haven't run 6.x on the same boxes but we see
the problems on two separate machines running FreeBSD 7).

I'm wondering whether anybody here has seen something similar or
might have an idea where to start looking for the cause. Any ideas?

Thanks,

Robin

--
Robin Sommer * Phone +1 (510) 666-2886 * [EMAIL PROTECTED]
ICSI/LBNL* Fax   +1 (510) 666-2956 *   www.icir.org
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to [EMAIL PROTECTED]