Re: increase softint_bytes
On Wed, 22 Nov 2017, Masanobu SAITOH wrote: Date: Wed, 22 Nov 2017 11:41:59 +0900 From: Masanobu SAITOH To: 6b...@6bone.informatik.uni-leipzig.de Cc: msai...@execsw.org, tech-kern@NetBSD.org Subject: Re: increase softint_bytes On 2017/11/21 15:35, Masanobu SAITOH wrote: On 2017/11/20 17:28, Masanobu SAITOH wrote: On 2017/11/17 18:42, 6b...@6bone.informatik.uni-leipzig.de wrote: On Thu, 16 Nov 2017, Masanobu SAITOH wrote: Hi, all. Some device drivers now allocate a lot of softints. See: http://mail-index.netbsd.org/current-users/2017/11/09/msg032581.html To avoid this panic, I wrote the following patch: http://www.netbsd.org/~msaitoh/softint-20171116-0.dif I tested the patch. Now the dump comes in another place. https://suse.uni-leipzig.de/crash/crash-with-patch.jpg Regards Uwe Could you test the following patch? http://www.netbsd.org/~msaitoh/vlan-20171120-0.dif Updated patch http://www.netbsd.org/~msaitoh/vlan-20171121-0.dif Fix compile error (sorry) Revert if_wmreg.h 1.104 and if_wm.c 1.542 Committed in -current. -current now boots without error on my server. Regards Uwe
Re: increase softint_bytes
On Thu, 16 Nov 2017, Masanobu SAITOH wrote: Hi, all. Some device drivers now allocate a lot of softints. See: http://mail-index.netbsd.org/current-users/2017/11/09/msg032581.html To avoid this panic, I wrote the following patch: http://www.netbsd.org/~msaitoh/softint-20171116-0.dif I tested the patch. Now the dump comes in another place. https://suse.uni-leipzig.de/crash/crash-with-patch.jpg Regards Uwe
Re: PR/49328 CVS commit: src/sys/dev/pci/ixgbe
On Wed, 4 Feb 2015, Christos Zoulas wrote: ... christos I have tested NetBSD 7.99.x The problem is now solved. The PR can be closed. Regards Uwe
frozen netbsd-6-rc4
hello, I am using netbsd-6-rc4 at a dell 1850. the server works only as router. it is connected with an lacp port-channel (4*1gb) to a cisco catalyst switch. sometimes (1/day or more often) the system cpu usage increases to 300 percent or more. in some cases netbsd is frozen (also the kernel debugger), in other cases netbsd continues to a normal work after some minutes. The top command shows the following: load averages: 0.76, 0.77, 0.58; up 4+03:49:27 11:31:11 33 processes: 31 sleeping, 2 on CPU CPU0 states: 6.2% user, 0.0% nice, 18.8% system, 11.2% interrupt, 63.9% idle CPU1 states: 5.0% user, 0.0% nice, 12.8% system, 3.6% interrupt, 78.6% idle Memory: 279M Act, 153M Inact, 6036K Wired, 20M Exec, 152M File, 296M Free Swap: 4506M Total, 4506M Free PID USERNAME PRI NICE SIZE RES STATE TIME WCPUCPU COMMAND 0 root 00 0K 6928K CPU/1 13:37 0.00% 277% [system] 464 root 430 316M 238M parked/0 21.4H 32.47% 32.47% named . Any idea what the problem could be? The hardware is ok and the switch does not show any abnormal network traffic. Thank you for your efforts Regards Uwe
Re: kernel crash at ibm x3850
Hello, I compiled a kernel with some more debug code. kernel text is mapped with 6 large pages and 34 normal pages Loaded initial symtab at 0x81258fa0, strtab at 0x81303f70, # entries 29097 Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010 The NetBSD Foundation, Inc. All rights reserved. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. NetBSD 5.1_STABLE (INSTALL) #7: Sun Jan 15 23:33:41 CET 2012 r...@6bone.informatik.uni-leipzig.de:/usr/obj/sys/arch/amd64/compile/INSTALL total memory = 511 GB avail memory = 496 GB RTC BIOS diagnostic error 80 SMBIOS rev. 2.7 @ 0x7f0be000 (137 entries) IBM System x3850 X5 -[71453RG]- (06) mainbus0 (root) cpu0 at mainbus0 apid 0: Intel 686-class, 1995MHz, id 0x206e6 fatal protection fault in supervisor mode trap type 4 code 0 rip 8056e456 cs 8 rflags 10246 cr2 0 cpl 8 rsp 8137ab98 kernel: protection fault trap, code=0 Stopped in pid 0.1 (system) at netbsd:rdmsr+0x6: rdmsr db{0}> bt rdmsr() at netbsd:rdmsr+0x6 est_init_once() at netbsd:est_init_once+0x148 _run_once() at netbsd:_run_once+0x67 cpu_identify() at netbsd:cpu_identify+0x171 cpu_attach() at netbsd:cpu_attach+0x21f config_attach_loc() at netbsd:config_attach_loc+0x15a mpacpi_config_cpu() at netbsd:mpacpi_config_cpu+0x6e acpi_madt_walk() at netbsd:acpi_madt_walk+0x45 mpacpi_scan_apics() at netbsd:mpacpi_scan_apics+0x90 mainbus_attach() at netbsd:mainbus_attach+0x26c config_attach_loc() at netbsd:config_attach_loc+0x15a cpu_configure() at netbsd:cpu_configure+0x26 main() at netbsd:main+0x1aa db{0}> show register ds 0x5 es 0 fs 0x64 gs 0xbed6 rdi 0xcd rsi 0x805690a0 est_init_once rbp 0x8137abb0 rbx 0xe rdx 0x1 rcx 0xcd rax 0 r8 0x3 r9 0 r10 0x1 r11 0x802e9da0 comcnputc r12 0x80c4ea00 cpu_info_primary r13 0x8137ad40 r14 0x8004b0b5e710 r15 0x1 rip 0x8056e456 rdmsr+0x6 cs 0x8 rflags 0x10246 rsp 0x8137ab98 ss 0x10 netbsd:rdmsr+0x6: rdmsr Thank you for your efforts Regards Uwe On Fri, 13 Jan 2012, Patrick Welche wrote: Date: Fri, 13 Jan 2012 12:34:09 + From: Patrick Welche To: 6b...@6bone.informatik.uni-leipzig.de Cc: tech-kern@netbsd.org Subject: Re: kernel crash at ibm x3850 On Fri, Jan 13, 2012 at 11:54:58AM +0100, 6b...@6bone.informatik.uni-leipzig.de wrote: if I boot the netbsd-5-1-1 install media at an ibm x3850 the kernel crashs. You can find the screenshot at https://suse.uni-leipzig.de/ibm-x3850.jpg Any ideas what could be the problem? Maybe you could get a backtrace (type "bt") at the prompt showing, and take a picture of that? (http://www.netbsd.org/docs/kernel/#ddb) Cheers, Patrick
Re: kernel crash at ibm x3850
Hello, sorry for the long delay, but the USB keyboard doesn't work in the ddb and I needed some time to configure SOL. Now the output: Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010 The NetBSD Foundation, Inc. All rights reserved. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. NetBSD 5.1_STABLE (INSTALL) #4: Sun Jan 15 13:56:06 CET 2012 r...@6bone.informatik.uni-leipzig.de:/usr/obj/sys/arch/amd64/compile/INSTALL total memory = 511 GB avail memory = 496 GB RTC BIOS diagnostic error 80 SMBIOS rev. 2.7 @ 0x7f0be000 (137 entries) IBM System x3850 X5 -[71453RG]- (06) mainbus0 (root) cpu0 at mainbus0 apid 0: Intel 686-class, 1995MHz, id 0x206e6 fatal protection fault in supervisor mode trap type 4 code 0 rip 80529b06 cs 8 rflags 10246 cr2 0 cpl 8 rsp 811e0b98 kernel: protection fault trap, code=0 Stopped in pid 0.1 (system) at 0x80529b06: rdmsr db{0}> bt ?() at 0x80529b06 ?() at 0x80524948 ?() at 0x8047aed2 ?() at 0x80517071 ?() at 0x80513cc1 ?() at 0x8046c2ea ?() at 0x80522cde ?() at 0x80726fe5 ?() at 0x80522b90 ?() at 0x805594dc ?() at 0x8046c2ea ?() at 0x805294a6 ?() at 0x80432bef Regards Uwe On Fri, 13 Jan 2012, Patrick Welche wrote: Date: Fri, 13 Jan 2012 12:34:09 + From: Patrick Welche To: 6b...@6bone.informatik.uni-leipzig.de Cc: tech-kern@netbsd.org Subject: Re: kernel crash at ibm x3850 On Fri, Jan 13, 2012 at 11:54:58AM +0100, 6b...@6bone.informatik.uni-leipzig.de wrote: if I boot the netbsd-5-1-1 install media at an ibm x3850 the kernel crashs. You can find the screenshot at https://suse.uni-leipzig.de/ibm-x3850.jpg Any ideas what could be the problem? Maybe you could get a backtrace (type "bt") at the prompt showing, and take a picture of that? (http://www.netbsd.org/docs/kernel/#ddb) Cheers, Patrick
kernel crash at ibm x3850
Hello, if I boot the netbsd-5-1-1 install media at an ibm x3850 the kernel crashs. You can find the screenshot at https://suse.uni-leipzig.de/ibm-x3850.jpg Any ideas what could be the problem? Thank you for your efforts Regards Uwe
netbsd at Dell M710
hello, some time ago Jean-Yves Migeon has solved some netbsd NIC problems at DELL M710 Servers. Unfortunately there are also some other issues with the perc controler. So it is not possible to run netbsd at the DELL Server. Can anyone port the driver for the M710 Server? I can offer access to the hardware for some days. Thank you for your efforts. Regards Uwe
Re: netbsd-5.1_RC3 crash at Dell M710
On Sat, 14 Aug 2010, Jean-Yves Migeon wrote: Date: Sat, 14 Aug 2010 00:29:09 +0200 From: Jean-Yves Migeon To: 6b...@6bone.informatik.uni-leipzig.de Cc: tech-kern@netbsd.org Subject: Re: netbsd-5.1_RC3 crash at Dell M710 On 14.08.2010 00:05, 6b...@6bone.informatik.uni-leipzig.de wrote: On Fri, 13 Aug 2010, Jean-Yves Migeon wrote: Date: Fri, 13 Aug 2010 17:03:14 +0200 From: Jean-Yves Migeon To: 6b...@6bone.informatik.uni-leipzig.de Cc: tech-kern@netbsd.org Subject: Re: netbsd-5.1_RC3 crash at Dell M710 On 13.08.2010 08:52, 6b...@6bone.informatik.uni-leipzig.de wrote: hello, netbsd crashs at Dell M710. You can have a look at the screeshot at http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp Any Ideas what could be the problem? Most probably, an attempt to read a MSR, which is not allowed/present for that CPU. At ddb prompt, type "bt" and "show reg", so we can see where and how it happens. http://6bone.informatik.uni-leipzig.de/Dell-M710-bt.bmp http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-1.bmp http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-2.bmp MSR 0xcd, which is MSR_FSB_FREQ. One hacky fix is needed. You seem to be in the same situation as mine there, quick glance at your CPU ID makes me think it reports model 0xc too: http://cvsweb.netbsd.org/cgi-bin/cvsweb.cgi/src/sys/arch/x86/x86/intel_busclock.c?rev=1.11&content-type=text/x-cvsweb-markup I asked for a pull-up about a week ago, so should come in eventually. Try patching around as I did. the patch solves the problem. unfortunately the kernel now has a problem with the broadcom nic. http://6bone.informatik.uni-leipzig.de/Dell-M710-bnx-no-PHY-found.bmp does there also a patch exist for this problem? Thank you for your efforts Regards Uwe
Re: netbsd-5.1_RC3 crash at Dell M710
On Fri, 13 Aug 2010, Jean-Yves Migeon wrote: Date: Fri, 13 Aug 2010 17:03:14 +0200 From: Jean-Yves Migeon To: 6b...@6bone.informatik.uni-leipzig.de Cc: tech-kern@netbsd.org Subject: Re: netbsd-5.1_RC3 crash at Dell M710 On 13.08.2010 08:52, 6b...@6bone.informatik.uni-leipzig.de wrote: hello, netbsd crashs at Dell M710. You can have a look at the screeshot at http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp Any Ideas what could be the problem? Most probably, an attempt to read a MSR, which is not allowed/present for that CPU. At ddb prompt, type "bt" and "show reg", so we can see where and how it happens. http://6bone.informatik.uni-leipzig.de/Dell-M710-bt.bmp http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-1.bmp http://6bone.informatik.uni-leipzig.de/Dell-M710-show-reg-2.bmp Regards Uwe
Re: netbsd-5.1_RC3 crash at Dell M710
On Fri, 13 Aug 2010, Jean-Yves Migeon wrote: Date: Fri, 13 Aug 2010 17:03:14 +0200 From: Jean-Yves Migeon To: 6b...@6bone.informatik.uni-leipzig.de Cc: tech-kern@netbsd.org Subject: Re: netbsd-5.1_RC3 crash at Dell M710 On 13.08.2010 08:52, 6b...@6bone.informatik.uni-leipzig.de wrote: hello, netbsd crashs at Dell M710. You can have a look at the screeshot at http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp Any Ideas what could be the problem? Most probably, an attempt to read a MSR, which is not allowed/present for that CPU. At ddb prompt, type "bt" and "show reg", so we can see where and how it happens. it is not so easy. my M710 is a server inside a blade chassis. console access exists only via dell drac server management. it seems, that the ddb does not support the keyboard simulated by dell drac managemend cards. thank you for your efforts regards Uwe
netbsd-5.1_RC3 crash at Dell M710
hello, netbsd crashs at Dell M710. You can have a look at the screeshot at http://6bone.informatik.uni-leipzig.de/Dell-M710.bmp Any Ideas what could be the problem? Regards Uwe