Re: [PATCH 7.2 v3 03/12] mm/huge_memory: remove READ_ONLY_THP_FOR_FS from file_thp_enabled()
On 27 Apr 2026, at 4:06, David Hildenbrand (Arm) wrote:
> On 4/25/26 14:00, Zi Yan wrote:
>> On 17 Apr 2026, at 22:44, Zi Yan wrote:
>>
>>> Replace it with a check on the max folio order of the file's address space
>>> mapping, making sure PMD THP is supported. Also remove the read-only fd
>>> check, since collapse_file() now makes sure all to-be-collapsed folios are
>>> clean and the created PMD file THP can be handled by FSes properly.
>>>
>>> Signed-off-by: Zi Yan
>>> ---
>>> mm/huge_memory.c | 8
>>> 1 file changed, 4 insertions(+), 4 deletions(-)
>>>
>>> diff --git a/mm/huge_memory.c b/mm/huge_memory.c
>>> index 970e077019b7..7e9cf8c0985f 100644
>>> --- a/mm/huge_memory.c
>>> +++ b/mm/huge_memory.c
>>> @@ -86,9 +86,6 @@ static inline bool file_thp_enabled(struct vm_area_struct
>>> *vma)
>>> {
>>> struct inode *inode;
>>>
>>> - if (!IS_ENABLED(CONFIG_READ_ONLY_THP_FOR_FS))
>>> - return false;
>>> -
>>> if (!vma->vm_file)
>>> return false;
>>>
>>> @@ -97,7 +94,10 @@ static inline bool file_thp_enabled(struct
>>> vm_area_struct *vma)
>>> if (IS_ANON_FILE(inode))
>>> return false;
>>>
>>> - return !inode_is_open_for_write(inode) && S_ISREG(inode->i_mode);
>>
>> Hi Matthew, David, and Lorenzo,
>>
>> After some discussions on irc, I feel that we probably should not allow
>> read-write fd for PMD THP collapse at the moment. Combining with the
>> filemap_flush() under folio_dirty() check from collapse_file(), khugepaged
>> would become a kwritebackd that scans pagecache folios and writes them back.
>> If we limit it to read-only fds, at least khugepaged would only write
>> back once for the pagecache folios from these fds.
>
> Makes sense. The comment above the filemap_flush() is valuable :)
>
>>
>> I am planning to restore inode_is_open_for_write() check in the next version.
>> Let me know your thoughts.
>
> That would be better. Do we plan on handling races with concurrent
> writable-opening?
>
Probably not worth the effort, READ_ONLY_THP_FOR_FS does it using
filemap_nr_thps()
and cost 8B in struct address_space. And read-only -> read->write race window
might
not be that a big issue, since it does not cause repeatedly write back.
> Or could we simply gate the filemap_flush() by a inode_is_open_for_write(), to
> not have it turn into a kwritebackd()?
This is a great idea. With it, not dirty folios from read-write fds can be
collapsed.
I will do it at the end.
Thanks.
--
Best Regards,
Yan, Zi
Re: [PATCH 7.2 v3 03/12] mm/huge_memory: remove READ_ONLY_THP_FOR_FS from file_thp_enabled()
On 4/25/26 14:00, Zi Yan wrote:
> On 17 Apr 2026, at 22:44, Zi Yan wrote:
>
>> Replace it with a check on the max folio order of the file's address space
>> mapping, making sure PMD THP is supported. Also remove the read-only fd
>> check, since collapse_file() now makes sure all to-be-collapsed folios are
>> clean and the created PMD file THP can be handled by FSes properly.
>>
>> Signed-off-by: Zi Yan
>> ---
>> mm/huge_memory.c | 8
>> 1 file changed, 4 insertions(+), 4 deletions(-)
>>
>> diff --git a/mm/huge_memory.c b/mm/huge_memory.c
>> index 970e077019b7..7e9cf8c0985f 100644
>> --- a/mm/huge_memory.c
>> +++ b/mm/huge_memory.c
>> @@ -86,9 +86,6 @@ static inline bool file_thp_enabled(struct vm_area_struct
>> *vma)
>> {
>> struct inode *inode;
>>
>> -if (!IS_ENABLED(CONFIG_READ_ONLY_THP_FOR_FS))
>> -return false;
>> -
>> if (!vma->vm_file)
>> return false;
>>
>> @@ -97,7 +94,10 @@ static inline bool file_thp_enabled(struct vm_area_struct
>> *vma)
>> if (IS_ANON_FILE(inode))
>> return false;
>>
>> -return !inode_is_open_for_write(inode) && S_ISREG(inode->i_mode);
>
> Hi Matthew, David, and Lorenzo,
>
> After some discussions on irc, I feel that we probably should not allow
> read-write fd for PMD THP collapse at the moment. Combining with the
> filemap_flush() under folio_dirty() check from collapse_file(), khugepaged
> would become a kwritebackd that scans pagecache folios and writes them back.
> If we limit it to read-only fds, at least khugepaged would only write
> back once for the pagecache folios from these fds.
Makes sense. The comment above the filemap_flush() is valuable :)
>
> I am planning to restore inode_is_open_for_write() check in the next version.
> Let me know your thoughts.
That would be better. Do we plan on handling races with concurrent
writable-opening?
Or could we simply gate the filemap_flush() by a inode_is_open_for_write(), to
not have it turn into a kwritebackd()?
--
Cheers,
David
Re: [PATCH 7.2 v3 03/12] mm/huge_memory: remove READ_ONLY_THP_FOR_FS from file_thp_enabled()
On 17 Apr 2026, at 22:44, Zi Yan wrote:
> Replace it with a check on the max folio order of the file's address space
> mapping, making sure PMD THP is supported. Also remove the read-only fd
> check, since collapse_file() now makes sure all to-be-collapsed folios are
> clean and the created PMD file THP can be handled by FSes properly.
>
> Signed-off-by: Zi Yan
> ---
> mm/huge_memory.c | 8
> 1 file changed, 4 insertions(+), 4 deletions(-)
>
> diff --git a/mm/huge_memory.c b/mm/huge_memory.c
> index 970e077019b7..7e9cf8c0985f 100644
> --- a/mm/huge_memory.c
> +++ b/mm/huge_memory.c
> @@ -86,9 +86,6 @@ static inline bool file_thp_enabled(struct vm_area_struct
> *vma)
> {
> struct inode *inode;
>
> - if (!IS_ENABLED(CONFIG_READ_ONLY_THP_FOR_FS))
> - return false;
> -
> if (!vma->vm_file)
> return false;
>
> @@ -97,7 +94,10 @@ static inline bool file_thp_enabled(struct vm_area_struct
> *vma)
> if (IS_ANON_FILE(inode))
> return false;
>
> - return !inode_is_open_for_write(inode) && S_ISREG(inode->i_mode);
Hi Matthew, David, and Lorenzo,
After some discussions on irc, I feel that we probably should not allow
read-write fd for PMD THP collapse at the moment. Combining with the
filemap_flush() under folio_dirty() check from collapse_file(), khugepaged
would become a kwritebackd that scans pagecache folios and writes them back.
If we limit it to read-only fds, at least khugepaged would only write
back once for the pagecache folios from these fds.
I am planning to restore inode_is_open_for_write() check in the next version.
Let me know your thoughts.
> + if (!mapping_pmd_thp_support(inode->i_mapping))
> + return false;
> +
> + return S_ISREG(inode->i_mode);
> }
>
> /* If returns true, we are unable to access the VMA's folios. */
> --
> 2.43.0
--
Best Regards,
Yan, Zi
Re: [PATCH 7.2 v3 03/12] mm/huge_memory: remove READ_ONLY_THP_FOR_FS from file_thp_enabled()
On 4/18/26 10:44 AM, Zi Yan wrote: Replace it with a check on the max folio order of the file's address space mapping, making sure PMD THP is supported. Also remove the read-only fd check, since collapse_file() now makes sure all to-be-collapsed folios are clean and the created PMD file THP can be handled by FSes properly. Signed-off-by: Zi Yan --- LGTM. Reviewed-by: Baolin Wang
[PATCH 7.2 v3 03/12] mm/huge_memory: remove READ_ONLY_THP_FOR_FS from file_thp_enabled()
Replace it with a check on the max folio order of the file's address space
mapping, making sure PMD THP is supported. Also remove the read-only fd
check, since collapse_file() now makes sure all to-be-collapsed folios are
clean and the created PMD file THP can be handled by FSes properly.
Signed-off-by: Zi Yan
---
mm/huge_memory.c | 8
1 file changed, 4 insertions(+), 4 deletions(-)
diff --git a/mm/huge_memory.c b/mm/huge_memory.c
index 970e077019b7..7e9cf8c0985f 100644
--- a/mm/huge_memory.c
+++ b/mm/huge_memory.c
@@ -86,9 +86,6 @@ static inline bool file_thp_enabled(struct vm_area_struct
*vma)
{
struct inode *inode;
- if (!IS_ENABLED(CONFIG_READ_ONLY_THP_FOR_FS))
- return false;
-
if (!vma->vm_file)
return false;
@@ -97,7 +94,10 @@ static inline bool file_thp_enabled(struct vm_area_struct
*vma)
if (IS_ANON_FILE(inode))
return false;
- return !inode_is_open_for_write(inode) && S_ISREG(inode->i_mode);
+ if (!mapping_pmd_thp_support(inode->i_mapping))
+ return false;
+
+ return S_ISREG(inode->i_mode);
}
/* If returns true, we are unable to access the VMA's folios. */
--
2.43.0

