Re: increase softint_bytes

2017-11-24 Thread 6bone

On Wed, 22 Nov 2017, Masanobu SAITOH wrote:


Date: Wed, 22 Nov 2017 11:41:59 +0900
From: Masanobu SAITOH 
To: 6b...@6bone.informatik.uni-leipzig.de
Cc: msai...@execsw.org, tech-kern@NetBSD.org
Subject: Re: increase softint_bytes

On 2017/11/21 15:35, Masanobu SAITOH wrote:

On 2017/11/20 17:28, Masanobu SAITOH wrote:

On 2017/11/17 18:42, 6b...@6bone.informatik.uni-leipzig.de wrote:

On Thu, 16 Nov 2017, Masanobu SAITOH wrote:


Hi, all.

Some device drivers now allocate a lot of softints.
See:

http://mail-index.netbsd.org/current-users/2017/11/09/msg032581.html

To avoid this panic, I wrote the following patch:

http://www.netbsd.org/~msaitoh/softint-20171116-0.dif



I tested the patch. Now the dump comes in another place.

https://suse.uni-leipzig.de/crash/crash-with-patch.jpg

Regards
Uwe


Could you test the following patch?

 http://www.netbsd.org/~msaitoh/vlan-20171120-0.dif


Updated patch

 http://www.netbsd.org/~msaitoh/vlan-20171121-0.dif

 Fix compile error (sorry)

 Revert if_wmreg.h 1.104 and if_wm.c 1.542


Committed in -current.


-current now boots without error on my server.

Regards
Uwe


Re: increase softint_bytes

2017-11-17 Thread 6bone

On Thu, 16 Nov 2017, Masanobu SAITOH wrote:


Hi, all.

Some device drivers now allocate a lot of softints.
See:

http://mail-index.netbsd.org/current-users/2017/11/09/msg032581.html

To avoid this panic, I wrote the following patch:

http://www.netbsd.org/~msaitoh/softint-20171116-0.dif



I tested the patch. Now the dump comes in another place.

https://suse.uni-leipzig.de/crash/crash-with-patch.jpg

Regards
Uwe


Re: PR/49328 CVS commit: src/sys/dev/pci/ixgbe

2015-02-17 Thread 6bone

On Wed, 4 Feb 2015, Christos Zoulas wrote:
...

christos


I have tested NetBSD 7.99.x

The problem is now solved. The PR can be closed.


Regards
Uwe


frozen netbsd-6-rc4

2013-05-12 Thread 6bone

hello,

I am using netbsd-6-rc4 at a dell 1850. the server works only as router. 
it is connected with an lacp port-channel (4*1gb) to a cisco catalyst 
switch.


sometimes (1/day or more often) the system cpu usage increases to 300 
percent or more. in some cases netbsd is frozen (also the kernel 
debugger), in other cases netbsd continues to a normal work after some 
minutes.


The top command shows the following:

load averages:  0.76,  0.77,  0.58;   up 4+03:49:27   11:31:11
33 processes: 31 sleeping, 2 on CPU
CPU0 states:  6.2% user,  0.0% nice, 18.8% system, 11.2% interrupt, 63.9% idle
CPU1 states:  5.0% user,  0.0% nice, 12.8% system,  3.6% interrupt, 78.6% idle
Memory: 279M Act, 153M Inact, 6036K Wired, 20M Exec, 152M File, 296M Free
Swap: 4506M Total, 4506M Free

  PID USERNAME PRI NICE   SIZE   RES STATE  TIME   WCPUCPU COMMAND
0 root   00 0K 6928K CPU/1 13:37  0.00%   277% [system]
  464 root  430   316M  238M parked/0  21.4H 32.47% 32.47% named
  .


Any idea what the problem could be? The hardware is ok and the switch 
does not show any abnormal network traffic.



Thank you for your efforts

Regards
Uwe


Re: kernel crash at ibm x3850

2012-01-16 Thread 6bone

Hello,

I compiled a kernel with some more debug code.


kernel text is mapped with 6 large pages and 34 normal pages
Loaded initial symtab at 0x81258fa0, strtab at 0x81303f70, 
# entries 29097

Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
2006, 2007, 2008, 2009, 2010
The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California.  All rights reserved.

NetBSD 5.1_STABLE (INSTALL) #7: Sun Jan 15 23:33:41 CET 2012

r...@6bone.informatik.uni-leipzig.de:/usr/obj/sys/arch/amd64/compile/INSTALL
total memory = 511 GB
avail memory = 496 GB
RTC BIOS diagnostic error 80
SMBIOS rev. 2.7 @ 0x7f0be000 (137 entries)
IBM System x3850 X5 -[71453RG]- (06)
mainbus0 (root)
cpu0 at mainbus0 apid 0: Intel 686-class, 1995MHz, id 0x206e6
fatal protection fault in supervisor mode
trap type 4 code 0 rip 8056e456 cs 8 rflags 10246 cr2  0 cpl 8 rsp 
8137ab98 
kernel: protection fault trap, code=0 
Stopped in pid 0.1 (system) at  netbsd:rdmsr+0x6:   rdmsr

db{0}> bt
rdmsr() at netbsd:rdmsr+0x6
est_init_once() at netbsd:est_init_once+0x148
_run_once() at netbsd:_run_once+0x67
cpu_identify() at netbsd:cpu_identify+0x171
cpu_attach() at netbsd:cpu_attach+0x21f
config_attach_loc() at netbsd:config_attach_loc+0x15a
mpacpi_config_cpu() at netbsd:mpacpi_config_cpu+0x6e
acpi_madt_walk() at netbsd:acpi_madt_walk+0x45
mpacpi_scan_apics() at netbsd:mpacpi_scan_apics+0x90
mainbus_attach() at netbsd:mainbus_attach+0x26c
config_attach_loc() at netbsd:config_attach_loc+0x15a
cpu_configure() at netbsd:cpu_configure+0x26
main() at netbsd:main+0x1aa
db{0}> show register
ds  0x5
es  0
fs  0x64
gs  0xbed6
rdi 0xcd
rsi 0x805690a0  est_init_once
rbp 0x8137abb0
rbx 0xe
rdx 0x1
rcx 0xcd
rax 0
r8  0x3
r9  0
r10 0x1
r11 0x802e9da0  comcnputc
r12 0x80c4ea00  cpu_info_primary
r13 0x8137ad40
r14 0x8004b0b5e710
r15 0x1
rip 0x8056e456  rdmsr+0x6
cs  0x8
rflags  0x10246
rsp 0x8137ab98
ss  0x10
netbsd:rdmsr+0x6:   rdmsr



Thank you for your efforts

Regards
Uwe


On Fri, 13 Jan 2012, Patrick Welche wrote:


Date: Fri, 13 Jan 2012 12:34:09 +
From: Patrick Welche 
To: 6b...@6bone.informatik.uni-leipzig.de
Cc: tech-kern@netbsd.org
Subject: Re: kernel crash at ibm x3850

On Fri, Jan 13, 2012 at 11:54:58AM +0100, 6b...@6bone.informatik.uni-leipzig.de 
wrote:

if I boot the netbsd-5-1-1 install media at an ibm x3850 the kernel
crashs. You can find the screenshot at
https://suse.uni-leipzig.de/ibm-x3850.jpg

Any ideas what could be the problem?


Maybe you could get a backtrace (type "bt") at the prompt showing, and
take a picture of that?
(http://www.netbsd.org/docs/kernel/#ddb)

Cheers,

Patrick



Re: kernel crash at ibm x3850

2012-01-15 Thread 6bone

Hello,

sorry for the long delay, but the USB keyboard doesn't work in the ddb 
and I needed some time to configure SOL.


Now the output:

Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
2006, 2007, 2008, 2009, 2010
The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California.  All rights reserved.

NetBSD 5.1_STABLE (INSTALL) #4: Sun Jan 15 13:56:06 CET 2012

r...@6bone.informatik.uni-leipzig.de:/usr/obj/sys/arch/amd64/compile/INSTALL
total memory = 511 GB
avail memory = 496 GB
RTC BIOS diagnostic error 80
SMBIOS rev. 2.7 @ 0x7f0be000 (137 entries)
IBM System x3850 X5 -[71453RG]- (06)
mainbus0 (root)
cpu0 at mainbus0 apid 0: Intel 686-class, 1995MHz, id 0x206e6
fatal protection fault in supervisor mode
trap type 4 code 0 rip 80529b06 cs 8 rflags 10246 cr2  0 cpl 8 rsp 
811e0b98

kernel: protection fault trap, code=0
Stopped in pid 0.1 (system) at  0x80529b06: rdmsr
db{0}> bt 
?() at 0x80529b06 
?() at 0x80524948

?() at 0x8047aed2
?() at 0x80517071
?() at 0x80513cc1
?() at 0x8046c2ea
?() at 0x80522cde
?() at 0x80726fe5
?() at 0x80522b90
?() at 0x805594dc
?() at 0x8046c2ea
?() at 0x805294a6
?() at 0x80432bef

Regards
Uwe

 On Fri, 13 Jan 2012, Patrick 
Welche wrote:



Date: Fri, 13 Jan 2012 12:34:09 +
From: Patrick Welche 
To: 6b...@6bone.informatik.uni-leipzig.de
Cc: tech-kern@netbsd.org
Subject: Re: kernel crash at ibm x3850

On Fri, Jan 13, 2012 at 11:54:58AM +0100, 6b...@6bone.informatik.uni-leipzig.de 
wrote:

if I boot the netbsd-5-1-1 install media at an ibm x3850 the kernel
crashs. You can find the screenshot at
https://suse.uni-leipzig.de/ibm-x3850.jpg

Any ideas what could be the problem?


Maybe you could get a backtrace (type "bt") at the prompt showing, and
take a picture of that?
(http://www.netbsd.org/docs/kernel/#ddb)

Cheers,

Patrick



kernel crash at ibm x3850

2012-01-13 Thread 6bone

Hello,

if I boot the netbsd-5-1-1 install media at an ibm x3850 the kernel 
crashs. You can find the screenshot at 
https://suse.uni-leipzig.de/ibm-x3850.jpg


Any ideas what could be the problem?


Thank you for your efforts


Regards
Uwe


netbsd at Dell M710

2011-03-31 Thread 6bone

hello,

some time ago Jean-Yves Migeon has solved some netbsd NIC problems at DELL 
M710 Servers. Unfortunately there are also some other issues with the perc 
controler. So it is not possible to run netbsd at the DELL Server.


Can anyone port the driver for the M710 Server? I can offer access to the 
hardware for some days.



Thank you for your efforts.

Regards
Uwe


Re: netbsd-5.1_RC3 crash at Dell M710

2010-08-15 Thread 6bone

On Sat, 14 Aug 2010, Jean-Yves Migeon wrote:


Date: Sat, 14 Aug 2010 00:29:09 +0200
From: Jean-Yves Migeon 
To: 6b...@6bone.informatik.uni-leipzig.de
Cc: tech-kern@netbsd.org
Subject: Re: netbsd-5.1_RC3 crash at Dell M710

On 14.08.2010 00:05, 6b...@6bone.informatik.uni-leipzig.de wrote:

On Fri, 13 Aug 2010, Jean-Yves Migeon wrote:


Date: Fri, 13 Aug 2010 17:03:14 +0200
From: Jean-Yves Migeon 
To: 6b...@6bone.informatik.uni-leipzig.de
Cc: tech-kern@netbsd.org
Subject: Re: netbsd-5.1_RC3 crash at Dell M710

On 13.08.2010 08:52, 6b...@6bone.informatik.uni-leipzig.de wrote:

hello,

netbsd crashs at Dell M710. You can have a look at the screeshot at
http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp

Any Ideas what could be the problem?


Most probably, an attempt to read a MSR, which is not allowed/present
for that CPU.

At ddb prompt, type "bt" and "show reg", so we can see where and how it
happens.



http://6bone.informatik.uni-leipzig.de/Dell-M710-bt.bmp
http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-1.bmp
http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-2.bmp


MSR 0xcd, which is MSR_FSB_FREQ.

One hacky fix is needed. You seem to be in the same situation as mine
there, quick glance at your CPU ID makes me think it reports model 0xc too:

http://cvsweb.netbsd.org/cgi-bin/cvsweb.cgi/src/sys/arch/x86/x86/intel_busclock.c?rev=1.11&content-type=text/x-cvsweb-markup

I asked for a pull-up about a week ago, so should come in eventually.
Try patching around as I did.



the patch solves the problem. unfortunately the kernel now has a problem 
with the broadcom nic.


http://6bone.informatik.uni-leipzig.de/Dell-M710-bnx-no-PHY-found.bmp

does there also a patch exist for this problem?


Thank you for your efforts


Regards
Uwe


Re: netbsd-5.1_RC3 crash at Dell M710

2010-08-13 Thread 6bone

On Fri, 13 Aug 2010, Jean-Yves Migeon wrote:


Date: Fri, 13 Aug 2010 17:03:14 +0200
From: Jean-Yves Migeon 
To: 6b...@6bone.informatik.uni-leipzig.de
Cc: tech-kern@netbsd.org
Subject: Re: netbsd-5.1_RC3 crash at Dell M710

On 13.08.2010 08:52, 6b...@6bone.informatik.uni-leipzig.de wrote:

hello,

netbsd crashs at Dell M710. You can have a look at the screeshot at
http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp

Any Ideas what could be the problem?


Most probably, an attempt to read a MSR, which is not allowed/present
for that CPU.

At ddb prompt, type "bt" and "show reg", so we can see where and how it
happens.



http://6bone.informatik.uni-leipzig.de/Dell-M710-bt.bmp
http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-1.bmp
http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-2.bmp


Regards
Uwe


Re: netbsd-5.1_RC3 crash at Dell M710

2010-08-13 Thread 6bone

On Fri, 13 Aug 2010, Jean-Yves Migeon wrote:


Date: Fri, 13 Aug 2010 17:03:14 +0200
From: Jean-Yves Migeon 
To: 6b...@6bone.informatik.uni-leipzig.de
Cc: tech-kern@netbsd.org
Subject: Re: netbsd-5.1_RC3 crash at Dell M710

On 13.08.2010 08:52, 6b...@6bone.informatik.uni-leipzig.de wrote:

hello,

netbsd crashs at Dell M710. You can have a look at the screeshot at
http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp

Any Ideas what could be the problem?


Most probably, an attempt to read a MSR, which is not allowed/present
for that CPU.

At ddb prompt, type "bt" and "show reg", so we can see where and how it
happens.



it is not so easy. my M710 is a server inside a blade chassis. console 
access exists only via dell drac server management. it seems, that the ddb 
does not support the keyboard simulated by dell drac managemend cards.


thank you for your efforts

regards
Uwe


netbsd-5.1_RC3 crash at Dell M710

2010-08-12 Thread 6bone

hello,

netbsd crashs at Dell M710. You can have a look at the screeshot at 
http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp


Any Ideas what could be the problem?


Regards
Uwe