RE: [PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-23 Thread Luck, Tony
> The general idea of preemptively poisoning pages which contain deferred > errors is fine though. Agreed. I used to think that it wasn't likely to be very useful because in many cases the UCNA errors are just a trail of breadcrumbs set by different units on the chip as the poison passed through

Re: [PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-23 Thread Borislav Petkov
On Fri, Oct 10, 2014 at 02:03:40PM +0800, Chen Yucong wrote: > From: Chen Yucong > > dram_ce_error() stems from Boris's patch set. Thanks! > Link: http://lkml.org/lkml/2014/7/1/545 > > Uncorrected no action required (UCNA) - is a UCR error that is not > signaled via a machine check exception

Re: [PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-23 Thread Borislav Petkov
On Fri, Oct 10, 2014 at 02:03:40PM +0800, Chen Yucong wrote: From: Chen Yucong sla...@gmail.com dram_ce_error() stems from Boris's patch set. Thanks! Link: http://lkml.org/lkml/2014/7/1/545 Uncorrected no action required (UCNA) - is a UCR error that is not signaled via a machine check

RE: [PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-23 Thread Luck, Tony
The general idea of preemptively poisoning pages which contain deferred errors is fine though. Agreed. I used to think that it wasn't likely to be very useful because in many cases the UCNA errors are just a trail of breadcrumbs set by different units on the chip as the poison passed through on

Re: [PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-22 Thread Chen Yucong
On Fri, 2014-10-10 at 14:03 +0800, Chen Yucong wrote: > From: Chen Yucong > > dram_ce_error() stems from Boris's patch set. Thanks! > Link: http://lkml.org/lkml/2014/7/1/545 > > Uncorrected no action required (UCNA) - is a UCR error that is not > signaled via a machine check exception and,

Re: [PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-22 Thread Chen Yucong
On Fri, 2014-10-10 at 14:03 +0800, Chen Yucong wrote: From: Chen Yucong sla...@gmail.com dram_ce_error() stems from Boris's patch set. Thanks! Link: http://lkml.org/lkml/2014/7/1/545 Uncorrected no action required (UCNA) - is a UCR error that is not signaled via a machine check exception

[PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-10 Thread Chen Yucong
From: Chen Yucong dram_ce_error() stems from Boris's patch set. Thanks! Link: http://lkml.org/lkml/2014/7/1/545 Uncorrected no action required (UCNA) - is a UCR error that is not signaled via a machine check exception and, instead, is reported to system software as a corrected machine check

[PATCH] x86, MCE: support memory error recovery for both UCNA and Deferred error in machine_check_poll

2014-10-10 Thread Chen Yucong
From: Chen Yucong sla...@gmail.com dram_ce_error() stems from Boris's patch set. Thanks! Link: http://lkml.org/lkml/2014/7/1/545 Uncorrected no action required (UCNA) - is a UCR error that is not signaled via a machine check exception and, instead, is reported to system software as a corrected