On Tue, 3 Oct 2006, Ivan Jager wrote:
On Mon, 2 Oct 2006, Grant Grundler wrote:
On Sun, Oct 01, 2006 at 02:15:42AM -0400, Ivan Jager wrote:
...
Then it will start cycling between:
FLT 500B SYS BD bus timeout
FLT CB74 SYS BD bad os HPMC cksm
FLT CBFC SYS BD OS HPMC br err
FLT CBF0 SYS BD HPMC initiated
To paraphrase, the chassis codes mean:
o Someone accessed an address that didn't respond.
o CPU timed out the transaction.
o HPMC == High Priority machince Check. Try "ser pim" at
the firmware prompt to see which address it tried to reference.
Ok, I'll try that sometime soon.
Heh, I was somehow expecting it to drop me back into a firmware prompt.
After a reboot the bits seemed to still be around, so I attached the
output of "ser pim". This was with the G200, xorg 7.1.0-1, on the c3700.
Nothing from Xorg.0.log survived. :(
I don't actually know what most of these bits mean. Is
MEM_ADDR = 0x000001ff3fffffff
the physical address it tried to access?
This may be relevant:
$ cat /proc/ioports
00000000-00001fff : PCI00 Ports
00000020-0000003e : pic1
000000a0-000000be : pic2
000003c0-000003df : matrox
000007e0-000007fe : acpi
00000800-000008ff : sym53c8xx
00000900-000009ff : sym53c8xx
00000a00-00000a07 : ide0
00000a08-00000a0f : ide1
00000e02-00000e02 : ide0
00000f00-00000f07 : ide0
00001000-0000107f : tulip
00012000-00013fff : PCI01 Ports
00028000-00029fff : PCI02 Ports
0003c000-0003dfff : PCI03 Ports
[EMAIL PROTECTED]:~$ cat /proc/iomem
00000000-bfffffff : System RAM
00000000-000009ff : PDC data (Page Zero)
00100000-004c1fff : Kernel code
004c2000-006cefff : Kernel data
fffffff0f05d0000-fffffff0f05d0000 : lcd_data
fffffff0f05d0008-fffffff0f05d0008 : lcd_cmd
fffffffff4000000-fffffffff47fffff : PCI00 LMMIO
fffffffff4000000-fffffffff4001fff : sym53c8xx
fffffffff4002000-fffffffff4003fff : sym53c8xx
fffffffff4004000-fffffffff40043ff : sym53c8xx
fffffffff4005000-fffffffff40053ff : sym53c8xx
fffffffff4007000-fffffffff4007fff : ohci_hcd
fffffffff4008000-fffffffff40083ff : tulip
fffffffff4009000-fffffffff400900f : AD1889
fffffffff400a000-fffffffff400a00f : AD1889
fffffffff400b000-fffffffff400b00f : AD1889
fffffffff400c000-fffffffff400c1ff : AD1889
fffffffff4800000-fffffffff4ffffff : PCI01 LMMIO
fffffffff6000000-fffffffff67fffff : PCI02 LMMIO
fffffffff7000000-fffffffff77fffff : PCI03 LMMIO
fffffffff8000000-fffffffff8ffffff : PCI01 ELMMIO
fffffffff8000000-fffffffff8ffffff : matroxfb FB
fffffffff9000000-fffffffff9003fff : matroxfb MMIO
fffffffffa000000-fffffffffbffffff : PCI03 ELMMIO
fffffffffed00000-fffffffffed00fff : 10
fffffffffed30000-fffffffffed30fff : 10:0
fffffffffed32000-fffffffffed32fff : 10:1
fffffffffed38000-fffffffffed38fff : 10:4
fffffffffed3c000-fffffffffed3cfff : 10:6
fffffffffef00000-fffffffffeffffff : Astro Intr Ack
fffffffffff80000-fffffffffffaffff : Central Bus
fffffffffffa0000-fffffffffffa0fff : 32
fffffffffffb0000-fffffffffffdffff : Local Broadcast
fffffffffffe0000-ffffffffffffffff : Global Broadcast
Ivan
Main Menu: Enter command > ser pim
PROCESSOR PIM INFORMATION
----------------- Processor 0 HPMC Information ------------------
Timestamp =
Wed Oct 4 02:31:42 GMT 2006 (20:06:10:04:02:31:42)
HPMC Chassis Codes = 2cbf0 2500b 2cbf4 2cbfc
General Registers 0 - 31
00-03 0000000000000000 0000000040ac37c0 0000000040ac373b 0000000040017ff0
04-07 00000000001c98b8 00000000c0650760 00000000c0650760 0000000000000040
08-11 0000000000000007 000000000019a0f4 00000000f9010000 0000000000000000
12-15 0000000000000000 0000000000000001 0000000000000004 00000000c0650760
16-19 0000000000010000 0000000000000055 00000000000000aa 0000000040b939bc
20-23 0000000000000000 0000000040ac36bc 0000000000000007 0000000000000001
24-27 0000000000000010 0000000040017ff0 00000000c065075c 00000000001c98b8
28-31 0000000040ac3844 0000000000000151 00000000c06509c0 0000000000000000
<Press any key to continue (q to quit)>
Control Registers 0 - 31
00-03 0000000000000000 0000000000000000 0000000000000000 0000000000000000
04-07 0000000000000000 0000000000000000 0000000000000000 0000000000000000
08-11 000000000000102c 0000000000000000 00000000000000c0 0000000000000008
12-15 0000000000000000 0000000000000000 0000000000103000 ff00000000000000
16-19 0000001a63539900 000000000040b000 0000000040ac384f 000000000f201094
20-23 00000000002403ff 40000000ffc17ff0 000000ff000eff0f 8000000000000000
24-27 00000000005c8000 00000000bd33a000 00000000ffffffff 00000000ffffffff
28-31 00000000ff7ffffe 00000000ff7ffffe 00000000cd24c000 0000000010620000
Space Registers 0 - 7
00-03 0040b000 0040b000 00000000 0040b000
04-07 0040b000 0040b000 0040b000 0040b000
<Press any key to continue (q to quit)>
IIA Space = 0x000000000040b000
IIA Offset = 0x0000000040ac37f7
Check Type = 0x20000000
CPU State = 0x9e000004
Cache Check = 0x00000000
TLB Check = 0x00000000
Bus Check = 0x003010bb
Assists Check = 0x00000000
Assist State = 0x00000000
Path Info = 0x00031800
System Responder Address = 0xfffffffffed10200
System Requestor Address = 0xfffffffffffa0000
Floating-Point Registers 0 - 31
00-03 0000001f00000000 0000000000000000 0000000000000000 0000000000000000
04-07 4084d00000000000 0000000000000000 000000001066fb60 0000000000000000
08-11 fffffffffffffc18 0000000000000000 0000000000000802 0000000013c36d10
12-15 c065076013c36d00 000100001066fb60 0000000000000000 fffffffffffffff4
16-19 000000001019d838 00000000106a9b18 00000000101386fc 000000001066fb60
20-23 00000000101882ec 0000000010685b60 0000000000010000 0000001000000000
24-27 0000100000000000 0000000000000000 412e848000000000 0000000013c44108
28-31 fffffffffffffc18 0000000000000000 0000000000000802 0000000013c31000
<Press any key to continue (q to quit)>
'9000/785 B,C,J Workstation Unarchitected (per-CPU)', rev 1, 140 bytes:
Check Summary = 0xcb81041000000000
Available Memory = 0x00000000c0000000
CPU Diagnose Register 2 = 0x0203000000802004
CPU Status Register 0 = 0x2020c20000000000
CPU Status Register 1 = 0x8080000000000000
SADD LOG = 0x5306006000000001
Read Short LOG = 0xc1a0f0f0f0400804
ERROR_STATUS = 0x0000000000000010
MEM_ADDR = 0x000001ff3fffffff
MEM_SYND = 0x0000000000000000
MEM_ADDR_CORR = 0x000001ff3fffffff
MEM_SYND_CORR = 0x0000000000000000
RUN_DATA_HIGH = 0xc1bff0fffed08040
RUN_DATA_LOW = 0xc1bff0fffed08040
RUN_CTRL = 0x0000021c00002a1c
RUN_ADDR = 0xc1bff0fffed08040
System Responder Path = 0x00ffffffffffffff
HPMC PIM Analysis Information:
Timestamp =
Wed Oct 4 02:31:42 GMT 2006 (20:06:10:04:02:31:42)
'9000/785 B,C,J Workstation HPMC PIM Analysis (per-CPU)', rev 0, 1304 bytes:
A Data Miss Timeout occurred while CPU 0 was
requesting information.
Memory/IO Controller Error Analysis Information:
The Memory/IO Controller only observed the Broadcast Error. It did not log
any additional information about the HPMC.
<Press any key to continue (q to quit)>
----------------- Processor 0 LPMC Information ------------------
Check Type = 0x00000000
I/D Cache Parity Info = 0x00000000
Cache Check = 0x00000000
TLB Check = 0x00000000
Bus Check = 0x00000000
Assists Check = 0x00000000
Assist State = 0x00000000
Path Info = 0x00000000
System Responder Address = 0x0000000000000000
System Requestor Address = 0x0000000000000000
----------------- Processor 0 TOC Information -------------------
General Registers 0 - 31
00-03 0000000000000000 0000000000000000 0000000000000000 0000000000000000
04-07 0000000000000000 0000000000000000 0000000000000000 0000000000000000
08-11 0000000000000000 0000000000000000 0000000000000000 0000000000000000
12-15 0000000000000000 0000000000000000 0000000000000000 0000000000000000
16-19 0000000000000000 0000000000000000 0000000000000000 0000000000000000
20-23 0000000000000000 0000000000000000 0000000000000000 0000000000000000
24-27 0000000000000000 0000000000000000 0000000000000000 0000000000000000
28-31 0000000000000000 0000000000000000 0000000000000000 0000000000000000
<Press any key to continue (q to quit)>
Control Registers 0 - 31
00-03 0000000000000000 0000000000000000 0000000000000000 0000000000000000
04-07 0000000000000000 0000000000000000 0000000000000000 0000000000000000
08-11 0000000000000000 0000000000000000 0000000000000000 0000000000000000
12-15 0000000000000000 0000000000000000 0000000000000000 0000000000000000
16-19 0000000000000000 0000000000000000 0000000000000000 0000000000000000
20-23 0000000000000000 0000000000000000 0000000000000000 0000000000000000
24-27 0000000000000000 0000000000000000 0000000000000000 0000000000000000
28-31 0000000000000000 0000000000000000 0000000000000000 0000000000000000
Space Registers 0 - 7
00-03 00000000 00000000 00000000 00000000
04-07 00000000 00000000 00000000 00000000
IIA Space = 0x0000000000000000
IIA Offset = 0x0000000000000000
CPU State = 0x00000000
<Press any key to continue (q to quit)>
Memory Error Log Information:
Timestamp =
Wed Oct 4 02:31:42 GMT 2006 (20:06:10:04:02:31:42)
'9000/785 B,C,J Workstation Memory Error Log', rev 0, 64 bytes:
No memory errors logged
I/O Module Error Log Information:
Timestamp =
Wed Oct 4 02:31:42 GMT 2006 (20:06:10:04:02:31:42)
'9000/785 B,C,J Workstation IO Error Log', rev 0, 228 bytes:
Rope Word1 Word2 Word3
------ ------------ ------------
0 0x00000000 0x0e0cc009 0x00000000fed30048
1 0x00000000 0x1e0cc009 0x00000000fed32048
2 ---------- 0x2e0cc009 ------------------
3 ---------- 0x3e0cc009 ------------------
4 0x00000000 0x4e0cc009 0x00000000fed38048
5 ---------- 0x5e0cc009 ------------------
6 0x00000000 0x6e0cc009 0x00000000fed3c048
7 ---------- 0x7e0cc009 ------------------
Main Menu: Enter command >