Help diagnosing system hangs?

2005-08-23 Thread Eric Rescorla
I've recently been experiencing frequent hangs with FreeBSD 5.3-RELEASE-p20.
Strangely, this didn't appear to happen with 5.3-RELEASE or before
a few months ago.

My platform is a P4-2.8 GHz (dmesg appended at end).

The behavior is that X freezes (sorry, I can't say if the clock is still
working because I forgot to check) and sshing into the box hangs.  A
hard reboot clears things and the box starts normally, except that it
fscks as expected. Nothing appears in /var/log/messages, etc.

Because this problem started fairly recently, my one guess is that this
is somehow related to the hyperthreading fix in SA-05:09. This processor
allegedly has hyperthreading but I'm running GENERIC. Any possibility
this could be responsible for hanging somehow?

If this isn't a likely culprit, I'm assuming that the next step is to
try to debug the hang with DDB. There's one added difficulty here which
is that I'm using a Matrox P650 and for some reason the X drivers are
screwy so that once you've entered X any attempt to use the virtual
consoles just gets you a blank screen. So, while I've rebuilt the kernel
with DDB, I suspect that I won't actually be able to force the system
console into debugging mode. I assume that this means I need a serial
console but based on the Handbook serial console section it looks like
I can't use a serial console along with the normal console, and since
I want to use X, that would be bad. Am I misreading this?

Any help FreeBSDers can offer would be much appreciated.

Best,
-Ekr


Copyright (c) 1992-2004 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.3-RELEASE-p20 #0: Wed Jul 27 05:56:48 PDT 2005
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC
ACPI APIC Table: A M I  OEMAPIC 
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Pentium(R) 4 CPU 2.80GHz (2798.67-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf25  Stepping = 5
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Hyperthreading: 2 logical CPUs
real memory  = 1072889856 (1023 MB)
avail memory = 1040326656 (992 MB)
ioapic0 Version 2.0 irqs 0-23 on motherboard
npx0: [FAST]
npx0: math processor on motherboard
npx0: INT 16 interface
acpi0: A M I OEMRSDT on motherboard
acpi0: Power Button (fixed)
Timecounter ACPI-fast frequency 3579545 Hz quality 1000
acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
cpu0: ACPI CPU port 0x530-0x537 on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
agp0: Intel 82875P host to AGP bridge mem 0xf400-0xf7ff at device 0.0 
on pci0
pcib1: ACPI PCI-PCI bridge at device 1.0 on pci0
pcib1: could not get PCI interrupt routing table for \\_SB_.PCI0.P0P1 - 
AE_NOT_FOUND
pci1: ACPI PCI bus on pcib1
pci1: display, VGA at device 0.0 (no driver attached)
pcib2: ACPI PCI-PCI bridge at device 3.0 on pci0
pci2: ACPI PCI bus on pcib2
em0: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
0xcf80-0xcf9f mem 0xfe9e-0xfe9f irq 18 at device 1.0 on pci2
em0: Ethernet address: 00:0e:a6:1c:7e:4d
em0:  Speed:N/A  Duplex:N/A
uhci0: Intel 82801EB (ICH5) USB controller USB-A port 0xef00-0xef1f irq 16 at 
device 29.0 on pci0
uhci0: [GIANT-LOCKED]
usb0: Intel 82801EB (ICH5) USB controller USB-A on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: Intel 82801EB (ICH5) USB controller USB-B port 0xef20-0xef3f irq 19 at 
device 29.1 on pci0
uhci1: [GIANT-LOCKED]
usb1: Intel 82801EB (ICH5) USB controller USB-B on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2: Intel 82801EB (ICH5) USB controller USB-C port 0xef40-0xef5f irq 18 at 
device 29.2 on pci0
uhci2: [GIANT-LOCKED]
usb2: Intel 82801EB (ICH5) USB controller USB-C on uhci2
usb2: USB revision 1.0
uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhci3: Intel 82801EB (ICH5) USB controller USB-D port 0xef80-0xef9f irq 16 at 
device 29.3 on pci0
uhci3: [GIANT-LOCKED]
usb3: Intel 82801EB (ICH5) USB controller USB-D on uhci3
usb3: USB revision 1.0
uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub3: 2 ports with 2 removable, self powered
pci0: serial bus, USB at device 29.7 (no driver attached)
pcib3: ACPI PCI-PCI bridge at device 30.0 on pci0
pci3: ACPI PCI bus on pcib3
fwohci0: VIA Fire II (VT6306) port 0xdc00-0xdc7f mem 0xfeaff800-0xfeaf 
irq 20 at device 3.0 on pci3
fwohci0: OHCI version 1.0 (ROM=1)
fwohci0: No. of Isochronous channels is 8.
fwohci0: EUI64 00:e0:18:00:00:4b:c3:e5
fwohci0: Phy 1394a available S400, 3 ports.
fwohci0: Link S400, max_rec 2048 bytes.
firewire0: IEEE1394(FireWire) bus on fwohci0

Re: Help diagnosing system hangs?

2005-08-23 Thread Nikolas Britton
On 8/23/05, Eric Rescorla [EMAIL PROTECTED] wrote:
 I've recently been experiencing frequent hangs with FreeBSD 5.3-RELEASE-p20.
 Strangely, this didn't appear to happen with 5.3-RELEASE or before
 a few months ago.
 
 My platform is a P4-2.8 GHz (dmesg appended at end).
 
 The behavior is that X freezes (sorry, I can't say if the clock is still
 working because I forgot to check) and sshing into the box hangs.  A
 hard reboot clears things and the box starts normally, except that it
 fscks as expected. Nothing appears in /var/log/messages, etc.
 
 Because this problem started fairly recently, my one guess is that this
 is somehow related to the hyperthreading fix in SA-05:09. This processor
 allegedly has hyperthreading but I'm running GENERIC. Any possibility
 this could be responsible for hanging somehow?

Have you ruled out hardware problems, what makes you think it's
FreeBSD? Is this problem random? etc.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Help diagnosing system hangs?

2005-08-23 Thread Eric Rescorla
Nikolas Britton [EMAIL PROTECTED] wrote:
 On 8/23/05, Eric Rescorla [EMAIL PROTECTED] wrote:
  I've recently been experiencing frequent hangs with FreeBSD 5.3-RELEASE-p20.
  Strangely, this didn't appear to happen with 5.3-RELEASE or before
  a few months ago.
  
  My platform is a P4-2.8 GHz (dmesg appended at end).
  
  The behavior is that X freezes (sorry, I can't say if the clock is still
  working because I forgot to check) and sshing into the box hangs.  A
  hard reboot clears things and the box starts normally, except that it
  fscks as expected. Nothing appears in /var/log/messages, etc.
  
  Because this problem started fairly recently, my one guess is that this
  is somehow related to the hyperthreading fix in SA-05:09. This processor
  allegedly has hyperthreading but I'm running GENERIC. Any possibility
  this could be responsible for hanging somehow?
 
 Have you ruled out hardware problems, what makes you think it's
 FreeBSD? Is this problem random? etc.

(1) No, I haven't ruled out hardware problems. However, I have
a nearly identical machine that started acting up in the
same way in the same time frame, which is why I'm suspecting
the OS. Meant to mention this but got bogged down in detail. Sorry!

(2) It does appear to be random, yes.

Thanks,
-Ekr
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Help diagnosing system hangs?

2005-08-23 Thread Mike Tancsa
On Mon, 22 Aug 2005 23:50:36 -0700, in sentex.lists.freebsd.questions
you wrote:

I've recently been experiencing frequent hangs with FreeBSD 5.3-RELEASE-p20.
Strangely, this didn't appear to happen with 5.3-RELEASE or before
a few months ago.


There are a lot of bugs that have been fixed since  5.3-p20. So even
if you dropped into the debugger and found what was going on, chances
are good that its a bug that has already been fixed. I would go to
RELENG_5 if you can, or even RELENG_6 if you wait another week or two.

---Mike

Mike Tancsa, Sentex communications http://www.sentex.net
Providing Internet Access since 1994
[EMAIL PROTECTED], (http://www.tancsa.com)
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]