Hi everybody,
this is my first time posting linux-smp mailing list and hope someone could
help me for my problem.
Here at office we are using a RedHat 6.0 distribution (kernel 2.2.5-15 smp)
on a Compaq proliant 5500 server (dual Xeon @ 500 Mhz processor, 512 megs of
RAM, scsi raid controller, intel 10/100 ether card).
We've got a lot of system crashes since installing Smp kernel last november
(at first, we used single processor kernel of the same distribution)
Tipically, the server 'lives' for a couple of days, when several user log
in, suddenly crashes without (apparent) reason.
I read the "Linux SMP HOWTO" and it seems 2.2 kernel should be quite robust
for SMP.
I know about APM problems and APIC settings but could not modify BIOS
settings in the server
What's wrong with it?
Please help me making that server "solid as a rock" !!
I attached last BootMessage.
Thanks in advance
Maurizio Barbaro
e-mail: [EMAIL PROTECTED]
ialized
Checking 386/387 coupling... OK, FPU using exception 16 error reporting.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.26 (19981001) Richard Gooch ([EMAIL PROTECTED])
per-CPU timeslice cutoff: 200.33 usecs.
CPU3: Intel 00/07 stepping 02
calibrating APIC timer ...
..... CPU clock speed is 499.8463 MHz.
..... system bus clock speed is 99.9691 MHz.
Booting processor 2 eip 2000
Calibrating delay loop... 499.71 BogoMIPS
OK.
CPU2: Intel 00/07 stepping 02
Total of 2 processors activated (997.79 BogoMIPS).
enabling symmetric IO mode... ...done.
ENABLING IO-APIC IRQs
init IO_APIC IRQs
IO-APIC pin 0, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32,
33, 34 not connected.
number of MP IRQ sources: 15.
number of IO-APIC registers: 35.
testing the IO APIC.......................
.... register #00: 00000000
....... : physical APIC id: 00
.... register #01: 00220011
....... : max redirection entries: 0022
WARNING: unexpected IO-APIC, please mail
to [EMAIL PROTECTED]
....... : IO APIC version: 0011
.... register #02: 00000000
....... : arbitration: 00
.... IRQ redirection table:
NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:
00 000 00 1 0 0 0 0 0 0 00
01 000 00 0 0 0 0 0 1 1 59
02 0FF 0F 0 0 0 0 0 1 1 51
03 000 00 0 0 0 0 0 1 1 61
04 000 00 0 0 0 0 0 1 1 69
05 0FF 0F 1 1 0 0 0 1 1 71
06 000 00 0 0 0 0 0 1 1 79
07 000 00 0 0 0 0 0 1 1 81
08 000 00 0 0 0 0 0 1 1 89
09 0FF 0F 1 1 0 0 0 1 1 91
0a 0FF 0F 1 1 0 0 0 1 1 99
0b 0FF 0F 1 1 0 0 0 1 1 A1
0c 000 00 0 0 0 0 0 1 1 A9
0d 000 00 1 0 0 0 0 0 0 00
0e 000 00 0 0 0 0 0 1 1 B1
0f 0FF 0F 1 1 0 0 0 1 1 B9
10 000 00 1 0 0 0 0 0 0 00
11 000 00 1 0 0 0 0 0 0 00
12 000 00 1 0 0 0 0 0 0 00
13 000 00 1 0 0 0 0 0 0 00
14 000 00 1 0 0 0 0 0 0 00
15 000 00 1 0 0 0 0 0 0 00
16 000 00 1 0 0 0 0 0 0 00
17 000 00 1 0 0 0 0 0 0 00
18 000 00 1 0 0 0 0 0 0 00
19 000 00 1 0 0 0 0 0 0 00
1a 000 00 1 0 0 0 0 0 0 00
1b 000 00 1 0 0 0 0 0 0 00
1c 000 00 1 0 0 0 0 0 0 00
1d 000 00 1 0 0 0 0 0 0 00
1e 000 00 1 0 0 0 0 0 0 00
1f 000 00 1 0 0 0 0 0 0 00
20 000 00 1 0 0 0 0 0 0 00
21 000 00 1 0 0 0 0 0 0 00
22 000 00 1 0 0 0 0 0 0 00
IRQ to pin mappings:
IRQ0 -> 2
IRQ1 -> 1
IRQ3 -> 3
IRQ4 -> 4
IRQ5 -> 5
IRQ6 -> 6
IRQ7 -> 7
IRQ8 -> 8
IRQ9 -> 9
IRQ10 -> 10
IRQ11 -> 11
IRQ12 -> 12
IRQ13 -> 13
IRQ14 -> 14
IRQ15 -> 15
.................................... done.
mtrr: your CPUs had inconsistent fixed MTRR settings
mtrr: probably your BIOS does not setup all CPUs
PCI: PCI BIOS revision 2.10 entry at 0xf0080
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: 00:80 [8086/84ca]: Scanning peer host bridges
PCI: Device 00:78 not found by BIOS
PCI: Device 00:80 not found by BIOS
PCI: Device 00:90 not found by BIOS
PCI: Device 00:98 not found by BIOS
Linux NET4.0 for Linux 2.2
Based upon Swansea University Computer Society NET3.039
NET4: Unix domain sockets 1.0 for Linux NET4.0.
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
Initializing RT netlink socket
Starting kswapd v 1.5
Detected PS/2 Mouse Port.
Serial driver version 4.27 with MANY_PORTS MULTIPORT SHARE_IRQ enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
pty: 256 Unix98 ptys configured
Real Time Clock Driver v1.09
RAM disk driver initialized: 16 RAM disks of 4096K size
PIIX4: IDE controller on PCI bus 00 dev 79
PIIX4: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0x3000-0x3007, BIOS settings: hda:pio, hdb:pio
hda: CD-ROM CDU701-Q, ATAPI CDROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: ATAPI 14X CD-ROM drive, 128kB Cache
Uniform CDROM driver Revision: 2.54
Floppy drive(s): fd0 is 1.44M
FDC 0 is a National Semiconductor PC87306
md driver 0.90.0 MAX_MD_DEVS=256, MAX_REAL=12
raid5: measuring checksumming speed
raid5: using high-speed MMX checksum routine
pII_mmx : 1194.054 MB/sec
p5_mmx : 1165.479 MB/sec
8regs : 861.822 MB/sec
32regs : 618.363 MB/sec
using fastest function: pII_mmx (1194.054 MB/sec)
scsi : 0 hosts.
scsi : detected total.
md.c: sizeof(mdp_super_t) = 4096
Partition check:
RAMDISK: Compressed image found at block 0
autodetecting RAID arrays
autorun ...
... autorun DONE.
VFS: Mounted root (ext2 filesystem).
ncr53c8xx: at PCI bus 0, device 13, function 0
ncr53c8xx: 53c876 detected
ncr53c8xx: at PCI bus 0, device 13, function 1
ncr53c8xx: 53c876 detected
ncr53c8xx: at PCI bus 4, device 5, function 0
ncr53c8xx: 53c896 detected
ncr53c8xx: at PCI bus 4, device 5, function 1
ncr53c8xx: 53c896 detected
ncr53c876-0: rev=0x14, base=0xc6ce0000, io_port=0x2400, irq=11
ncr53c876-0: NCR clock is 40401KHz, 40218KHz
ncr53c876-0: ID 7, Fast-20, Parity Checking
ncr53c876-0: on-chip RAM at 0xc6cd0000
ncr53c876-0: restart (scsi reset).
ncr53c876-0: Downloading SCSI SCRIPTS.
ncr53c876-1: rev=0x14, base=0xc6cc0000, io_port=0x2800, irq=15
ncr53c876-1: ID 7, Fast-20, Parity Checking
ncr53c876-1: on-chip RAM at 0xc6cb0000
ncr53c876-1: restart (scsi reset).
ncr53c876-1: Downloading SCSI SCRIPTS.
ncr53c896-2: rev=0x05, base=0xc6ef0000, io_port=0x4000, irq=5
ncr53c896-2: NCR clock is 40218KHz, 40218KHz
ncr53c896-2: ID 7, Fast-40, Parity Checking
ncr53c896-2: on-chip RAM at 0xc6ee0000
ncr53c896-2: restart (scsi reset).
ncr53c896-2: Downloading SCSI SCRIPTS.
ncr53c896-3: rev=0x05, base=0xc6ed0000, io_port=0x4400, irq=9
ncr53c896-3: NCR clock is 40218KHz, 40218KHz
ncr53c896-3: ID 7, Fast-40, Parity Checking
ncr53c896-3: on-chip RAM at 0xc6ec0000
ncr53c896-3: restart (scsi reset).
ncr53c896-3: Downloading SCSI SCRIPTS.
scsi0 : ncr53c8xx - revision 3.1h
scsi1 : ncr53c8xx - revision 3.1h
scsi2 : ncr53c8xx - revision 3.1h
scsi3 : ncr53c8xx - revision 3.1h
scsi : 4 hosts.
ncr53c876-1-<6,*>: FAST-10 SCSI 10.0 MB/s (100 ns, offset 16)
Vendor: HP Model: C1533A Rev: A708
Type: Sequential-Access ANSI SCSI revision: 02
Detected scsi tape st0 at scsi1, channel 0, id 6, lun 0
Compaq SMART2 Driver (v 0.9.9)
Found 1 controller(s)
Finding drives on ida0 (SMART-3200)
ida/c0d0: blksz=512 nr_blks=53309280
ida/c0d0: p1 p2 < p5 p6 p7 p8 p9 >
autodetecting RAID arrays
autorun ...
... autorun DONE.
VFS: Mounted root (ext2 filesystem) readonly.
change_root: old root has d_count=1
Trying to unmount old root ... okay
Freeing unused kernel memory: 68k freed
Adding Swap: 130540k swap-space (priority -1)
eth0: OEM i82557/i82558 10/100 Ethernet at 0x2000, 00:50:8B:13:77:53, IRQ 10.
Board assembly 692290-005, Physical connectors present: RJ45
Primary interface chip i82555 PHY #1.
General self-test: passed.
Serial sub-system self-test: passed.
Internal registers self-test: passed.
ROM checksum self-test: passed (0x24c9f043).
Receiver lock-up workaround activated.
Installing knfsd (copyright (C) 1996 [EMAIL PROTECTED]).
nfsd_init: initialized fhcache, entries=256