Re: [RFC RESEND 0/6] hugetlbfs largepage RAS project

2024-09-19 Thread William Roche
Hello David, I hope my last week email answered your interrogations about:     - retrieving the valid data from the lost hugepage     - the need of smaller pages to replace a failed large page     - the interaction of memory error and VM migration     - the non-symmetrical access to a poisoned me

Re: [RFC RESEND 0/6] hugetlbfs largepage RAS project

2024-09-12 Thread William Roche
On 9/12/24 00:07, David Hildenbrand wrote: Hi again, This is a Qemu RFC to introduce the possibility to deal with hardware memory errors impacting hugetlbfs memory backed VMs. When using hugetlbfs large pages, any large page location being impacted by an HW memory error results in poisoning th

Re: [RFC RESEND 0/6] hugetlbfs largepage RAS project

2024-09-11 Thread David Hildenbrand
Hi again, This is a Qemu RFC to introduce the possibility to deal with hardware memory errors impacting hugetlbfs memory backed VMs. When using hugetlbfs large pages, any large page location being impacted by an HW memory error results in poisoning the entire page, suddenly making a large chunk

Re: [RFC RESEND 0/6] hugetlbfs largepage RAS project

2024-09-10 Thread William Roche
On 9/10/24 13:36, David Hildenbrand wrote: On 10.09.24 12:02, “William Roche wrote: From: William Roche Hi, Apologies for the noise; resending as I missed CC'ing the maintainers of the changed files Hello, This is a Qemu RFC to introduce the possibility to deal with hardware memory

Re: [RFC RESEND 0/6] hugetlbfs largepage RAS project

2024-09-10 Thread David Hildenbrand
On 10.09.24 12:02, “William Roche wrote: From: William Roche Hi, Apologies for the noise; resending as I missed CC'ing the maintainers of the changed files Hello, This is a Qemu RFC to introduce the possibility to deal with hardware memory errors impacting hugetlbfs memory backed VMs. W

[RFC RESEND 0/6] hugetlbfs largepage RAS project

2024-09-10 Thread “William Roche
From: William Roche Apologies for the noise; resending as I missed CC'ing the maintainers of the changed files Hello, This is a Qemu RFC to introduce the possibility to deal with hardware memory errors impacting hugetlbfs memory backed VMs. When using hugetlbfs large pages, any large page loc