Greg Keller <[email protected]> writes:

>> Date: Mon, 25 Jan 2010 10:46:31 -0800
>> From: "David Mathog" <[email protected]>
>> Subject: [Beowulf] Logging MCE information on next warm boot?
>> To: [email protected]
>> Message-ID: <[email protected]>
>> Content-Type: text/plain; charset=iso-8859-1
>>
>> Is it possible to have the Machine Check Exception (MCE) information
>> saved to disk automatically on the next warm boot?
>
> David,
>
> I believe the utility you are looking for is mcelog.  We usually run  it with
> the following arguments:
> /usr/sbin/mcelog -h --ignorenodev --filter
>
> I think it clears the info after it reports it, so make sure to tee it  to a
> file.  I don't understand the command or the flags, just a copy /  paste 
> script
> kiddy in these regards, but I hope it helps.

In the case of a panic this won't work.  You would need to setup kdump or
something like that to capture the panic.

This sounds like L1 or L2 cache corruption but I haven't ever had any
machine checks on anything before the k8 core.  Wow.  You are talking about
old machines.

If machine check registers are kept across reboot there is a reasonable
chance that the firmware clears them.

Eric
_______________________________________________
Beowulf mailing list, [email protected] sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to