Hi Boris, Hi James, >-----Original Message----- >From: Borislav Petkov [mailto:[email protected]] >Sent: 01 October 2020 18:31 >To: James Morse <[email protected]> >Cc: Shiju Jose <[email protected]>; [email protected]; linux- >[email protected]; [email protected]; [email protected]; >[email protected]; [email protected]; Linuxarm <[email protected]> >Subject: Re: [PATCH 1/1] RAS: Add CPU Correctable Error Collector to isolate >an erroneous CPU core > >On Thu, Oct 01, 2020 at 06:16:03PM +0100, James Morse wrote: >> If the corrected-count is available somewhere, can't this policy be >> made in user-space? > >You mean rasdaemon goes and offlines CPUs when certain thresholds are >reached? Sure. It would be much more flexible too.
I will send the kernel changes for existing CEC to support the CPU CE errors. Can you please have a look? Thanks, Shiju > >-- >Regards/Gruss, > Boris. > >https://people.kernel.org/tglx/notes-about-netiquette

