On Mon, Jun 08, 2026 at 04:39:18AM -0400, Michael S. Tsirkin wrote:
> Convert vma_alloc_anon_folio_pmd() to pass __GFP_ZERO instead of
> zeroing at the callsite. post_alloc_hook uses the fault address
> passed through vma_alloc_folio for cache-friendly zeroing.
>
> Note: before this series, replacing folio_zero_user() with
> __GFP_ZERO was unsafe on cache-aliasing architectures because
> __GFP_ZERO uses clear_page() without a dcache flush. With this
> series, it is safe if the caller passes a valid user address
> (not USER_ADDR_NONE) to vma_alloc_folio() etc., which delivers
> it to post_alloc_hook() for the dcache flush via
> folio_zero_user(). It is only unsafe if USER_ADDR_NONE is passed.
>
> Note: with __GFP_ZERO, the folio is zeroed before
> mem_cgroup_charge().  If the charge fails, the zeroing work is
> wasted.  Previously zeroing was done after a successful charge.
> This is inherent to moving zeroing into the allocator.
> Charge failures are rare (only at cgroup limits).
>
> Use folio_put_zeroed() on charge failure so the zeroed hint
> propagates to the buddy allocator, avoiding redundant re-zeroing
> on the next allocation attempt.

Again, is this worth it?...

Every bit of code added increases risks of bugs, maintenance burden,
etc. let's just not do stuff because we can.

>
> Signed-off-by: Michael S. Tsirkin <[email protected]>
> Reviewed-by: Gregory Price <[email protected]>
> Assisted-by: Claude:claude-opus-4-6
> ---
>  mm/huge_memory.c | 14 +++-----------
>  1 file changed, 3 insertions(+), 11 deletions(-)
>
> diff --git a/mm/huge_memory.c b/mm/huge_memory.c
> index d689e6491ddb..0dec3c717ff2 100644
> --- a/mm/huge_memory.c
> +++ b/mm/huge_memory.c
> @@ -1333,7 +1333,7 @@ EXPORT_SYMBOL_GPL(thp_get_unmapped_area);
>  static struct folio *vma_alloc_anon_folio_pmd(struct vm_area_struct *vma,
>               unsigned long addr)
>  {
> -     gfp_t gfp = vma_thp_gfp_mask(vma);
> +     gfp_t gfp = vma_thp_gfp_mask(vma) | __GFP_ZERO;
>       const int order = HPAGE_PMD_ORDER;
>       struct folio *folio;
>
> @@ -1347,7 +1347,7 @@ static struct folio *vma_alloc_anon_folio_pmd(struct 
> vm_area_struct *vma,
>
>       VM_BUG_ON_FOLIO(!folio_test_large(folio), folio);
>       if (mem_cgroup_charge(folio, vma->vm_mm, gfp)) {
> -             folio_put(folio);
> +             folio_put_zeroed(folio);

Same comments as previously.

>               count_vm_event(THP_FAULT_FALLBACK);
>               count_vm_event(THP_FAULT_FALLBACK_CHARGE);
>               count_mthp_stat(order, MTHP_STAT_ANON_FAULT_FALLBACK);
> @@ -1356,17 +1356,9 @@ static struct folio *vma_alloc_anon_folio_pmd(struct 
> vm_area_struct *vma,
>       }
>       folio_throttle_swaprate(folio, gfp);
>
> -       /*
> -     * When a folio is not zeroed during allocation (__GFP_ZERO not used)
> -     * or user folios require special handling, folio_zero_user() is used to
> -     * make sure that the page corresponding to the faulting address will be
> -     * hot in the cache after zeroing.
> -     */
> -     if (user_alloc_needs_zeroing())
> -             folio_zero_user(folio, addr);
>       /*
>        * The memory barrier inside __folio_mark_uptodate makes sure that
> -      * folio_zero_user writes become visible before the set_pmd_at()
> +      * page zeroing becomes visible before the set_pmd_at()

folio zeroing?

>        * write.
>        */
>       __folio_mark_uptodate(folio);
> --
> MST
>

Thanks, Lorenzo

Reply via email to