Re: [RFC v5 02/38] powerpc: Free up four 64K PTE bits in 64K backed HPTE pages
On Wed, Jul 12, 2017 at 01:10:51PM +1000, Balbir Singh wrote: > On Tue, 11 Jul 2017 08:44:15 -0700 > Ram Pai wrote: > > > On Tue, Jul 11, 2017 at 03:59:59PM +1000, Balbir Singh wrote: > > > On Wed, 5 Jul 2017 14:21:39 -0700 > > > Ram Pai wrote: > > > > > > > Rearrange 64K PTE bits to free up bits 3, 4, 5 and 6 > > > > in the 64K backed HPTE pages. This along with the earlier > > > > patch will entirely free up the four bits from 64K PTE. > > > > The bit numbers are big-endian as defined in the ISA3.0 > > > > > > > > This patch does the following change to 64K PTE backed > > > > by 64K HPTE. > > > > > > > > H_PAGE_F_SECOND (S) which occupied bit 4 moves to the > > > > second part of the pte to bit 60. > > > > H_PAGE_F_GIX (G,I,X) which occupied bit 5, 6 and 7 also > > > > moves to the second part of the pte to bit 61, > > > > 62, 63, 64 respectively > > > > > > > > since bit 7 is now freed up, we move H_PAGE_BUSY (B) from > > > > bit 9 to bit 7. > > > > > > > > The second part of the PTE will hold > > > > (H_PAGE_F_SECOND|H_PAGE_F_GIX) at bit 60,61,62,63. > > > > > > > > Before the patch, the 64K HPTE backed 64k PTE format was > > > > as follows > > > > > > > > 0 1 2 3 4 5 6 7 8 9 10...63 > > > > : : : : : : : : : : :: > > > > v v v v v v v v v v vv > > > > > > > > ,-,-,-,-,--,--,--,--,-,-,-,-,-,--,-,-,-, > > > > |x|x|x| |S |G |I |X |x|B|x|x|x||.|.|.|.| <- primary pte > > > > '_'_'_'_'__'__'__'__'_'_'_'_'_''_'_'_'_' > > > > | | | | | | | | | | | | |..| | | | | <- secondary > > > > pte > > > > '_'_'_'_'__'__'__'__'_'_'_'_'__'_'_'_'_' > > > > > > > > > > It's not entirely clear what the secondary pte contains > > > today and how many of the bits are free today? > > > > The secondary pte today is not used for anything for 64k-hpte > > backed ptes. It gets used the moment the pte gets backed by > > 4-k hptes. Till then the bits are available. And this patch > > makes use of that knowledge. > > OK.. but does this mean subpage-protection? Or do you mean > page size demotion? I presume it's the later. Yes. the later. RP
Re: [RFC v5 02/38] powerpc: Free up four 64K PTE bits in 64K backed HPTE pages
On Tue, 11 Jul 2017 08:44:15 -0700 Ram Pai wrote: > On Tue, Jul 11, 2017 at 03:59:59PM +1000, Balbir Singh wrote: > > On Wed, 5 Jul 2017 14:21:39 -0700 > > Ram Pai wrote: > > > > > Rearrange 64K PTE bits to free up bits 3, 4, 5 and 6 > > > in the 64K backed HPTE pages. This along with the earlier > > > patch will entirely free up the four bits from 64K PTE. > > > The bit numbers are big-endian as defined in the ISA3.0 > > > > > > This patch does the following change to 64K PTE backed > > > by 64K HPTE. > > > > > > H_PAGE_F_SECOND (S) which occupied bit 4 moves to the > > > second part of the pte to bit 60. > > > H_PAGE_F_GIX (G,I,X) which occupied bit 5, 6 and 7 also > > > moves to the second part of the pte to bit 61, > > > 62, 63, 64 respectively > > > > > > since bit 7 is now freed up, we move H_PAGE_BUSY (B) from > > > bit 9 to bit 7. > > > > > > The second part of the PTE will hold > > > (H_PAGE_F_SECOND|H_PAGE_F_GIX) at bit 60,61,62,63. > > > > > > Before the patch, the 64K HPTE backed 64k PTE format was > > > as follows > > > > > > 0 1 2 3 4 5 6 7 8 9 10...63 > > > : : : : : : : : : : :: > > > v v v v v v v v v v vv > > > > > > ,-,-,-,-,--,--,--,--,-,-,-,-,-,--,-,-,-, > > > |x|x|x| |S |G |I |X |x|B|x|x|x||.|.|.|.| <- primary pte > > > '_'_'_'_'__'__'__'__'_'_'_'_'_''_'_'_'_' > > > | | | | | | | | | | | | |..| | | | | <- secondary pte > > > '_'_'_'_'__'__'__'__'_'_'_'_'__'_'_'_'_' > > > > > > > It's not entirely clear what the secondary pte contains > > today and how many of the bits are free today? > > The secondary pte today is not used for anything for 64k-hpte > backed ptes. It gets used the moment the pte gets backed by > 4-k hptes. Till then the bits are available. And this patch > makes use of that knowledge. OK.. but does this mean subpage-protection? Or do you mean page size demotion? I presume it's the later. Balbir Singh.
Re: [RFC v5 02/38] powerpc: Free up four 64K PTE bits in 64K backed HPTE pages
On Tue, Jul 11, 2017 at 03:59:59PM +1000, Balbir Singh wrote: > On Wed, 5 Jul 2017 14:21:39 -0700 > Ram Pai wrote: > > > Rearrange 64K PTE bits to free up bits 3, 4, 5 and 6 > > in the 64K backed HPTE pages. This along with the earlier > > patch will entirely free up the four bits from 64K PTE. > > The bit numbers are big-endian as defined in the ISA3.0 > > > > This patch does the following change to 64K PTE backed > > by 64K HPTE. > > > > H_PAGE_F_SECOND (S) which occupied bit 4 moves to the > > second part of the pte to bit 60. > > H_PAGE_F_GIX (G,I,X) which occupied bit 5, 6 and 7 also > > moves to the second part of the pte to bit 61, > > 62, 63, 64 respectively > > > > since bit 7 is now freed up, we move H_PAGE_BUSY (B) from > > bit 9 to bit 7. > > > > The second part of the PTE will hold > > (H_PAGE_F_SECOND|H_PAGE_F_GIX) at bit 60,61,62,63. > > > > Before the patch, the 64K HPTE backed 64k PTE format was > > as follows > > > > 0 1 2 3 4 5 6 7 8 9 10...63 > > : : : : : : : : : : :: > > v v v v v v v v v v vv > > > > ,-,-,-,-,--,--,--,--,-,-,-,-,-,--,-,-,-, > > |x|x|x| |S |G |I |X |x|B|x|x|x||.|.|.|.| <- primary pte > > '_'_'_'_'__'__'__'__'_'_'_'_'_''_'_'_'_' > > | | | | | | | | | | | | |..| | | | | <- secondary pte > > '_'_'_'_'__'__'__'__'_'_'_'_'__'_'_'_'_' > > > > It's not entirely clear what the secondary pte contains > today and how many of the bits are free today? The secondary pte today is not used for anything for 64k-hpte backed ptes. It gets used the moment the pte gets backed by 4-k hptes. Till then the bits are available. And this patch makes use of that knowledge. Will add some words in the patch description towards this. Thanks, RP > > Balbir Singh. -- Ram Pai
Re: [RFC v5 02/38] powerpc: Free up four 64K PTE bits in 64K backed HPTE pages
On Wed, 5 Jul 2017 14:21:39 -0700 Ram Pai wrote: > Rearrange 64K PTE bits to free up bits 3, 4, 5 and 6 > in the 64K backed HPTE pages. This along with the earlier > patch will entirely free up the four bits from 64K PTE. > The bit numbers are big-endian as defined in the ISA3.0 > > This patch does the following change to 64K PTE backed > by 64K HPTE. > > H_PAGE_F_SECOND (S) which occupied bit 4 moves to the > second part of the pte to bit 60. > H_PAGE_F_GIX (G,I,X) which occupied bit 5, 6 and 7 also > moves to the second part of the pte to bit 61, > 62, 63, 64 respectively > > since bit 7 is now freed up, we move H_PAGE_BUSY (B) from > bit 9 to bit 7. > > The second part of the PTE will hold > (H_PAGE_F_SECOND|H_PAGE_F_GIX) at bit 60,61,62,63. > > Before the patch, the 64K HPTE backed 64k PTE format was > as follows > > 0 1 2 3 4 5 6 7 8 9 10...63 > : : : : : : : : : : :: > v v v v v v v v v v vv > > ,-,-,-,-,--,--,--,--,-,-,-,-,-,--,-,-,-, > |x|x|x| |S |G |I |X |x|B|x|x|x||.|.|.|.| <- primary pte > '_'_'_'_'__'__'__'__'_'_'_'_'_''_'_'_'_' > | | | | | | | | | | | | |..| | | | | <- secondary pte > '_'_'_'_'__'__'__'__'_'_'_'_'__'_'_'_'_' > It's not entirely clear what the secondary pte contains today and how many of the bits are free today? Balbir Singh.
[RFC v5 02/38] powerpc: Free up four 64K PTE bits in 64K backed HPTE pages
Rearrange 64K PTE bits to free up bits 3, 4, 5 and 6 in the 64K backed HPTE pages. This along with the earlier patch will entirely free up the four bits from 64K PTE. The bit numbers are big-endian as defined in the ISA3.0 This patch does the following change to 64K PTE backed by 64K HPTE. H_PAGE_F_SECOND (S) which occupied bit 4 moves to the second part of the pte to bit 60. H_PAGE_F_GIX (G,I,X) which occupied bit 5, 6 and 7 also moves to the second part of the pte to bit 61, 62, 63, 64 respectively since bit 7 is now freed up, we move H_PAGE_BUSY (B) from bit 9 to bit 7. The second part of the PTE will hold (H_PAGE_F_SECOND|H_PAGE_F_GIX) at bit 60,61,62,63. Before the patch, the 64K HPTE backed 64k PTE format was as follows 0 1 2 3 4 5 6 7 8 9 10...63 : : : : : : : : : : :: v v v v v v v v v v vv ,-,-,-,-,--,--,--,--,-,-,-,-,-,--,-,-,-, |x|x|x| |S |G |I |X |x|B|x|x|x||.|.|.|.| <- primary pte '_'_'_'_'__'__'__'__'_'_'_'_'_''_'_'_'_' | | | | | | | | | | | | |..| | | | | <- secondary pte '_'_'_'_'__'__'__'__'_'_'_'_'__'_'_'_'_' After the patch, the 64k HPTE backed 64k PTE format is as follows 0 1 2 3 4 5 6 7 8 9 10...63 : : : : : : : : : : :: v v v v v v v v v v vv ,-,-,-,-,--,--,--,--,-,-,-,-,-,--,-,-,-, |x|x|x| | | | |B |x|x|x|x|x||.|.|.|.| <- primary pte '_'_'_'_'__'__'__'__'_'_'_'_'_''_'_'_'_' | | | | | | | | | | | | |..|S|G|I|X| <- secondary pte '_'_'_'_'__'__'__'__'_'_'_'_'__'_'_'_'_' The above PTE changes is applicable to hugetlbpages aswell. The patch does the following code changes: a) moves the H_PAGE_F_SECOND and H_PAGE_F_GIX to 4k PTE header since it is no more needed b the 64k PTEs. b) abstracts out __real_pte() and __rpte_to_hidx() so the caller need not know the bit location of the slot. c) moves the slot bits the secondary pte. Signed-off-by: Ram Pai --- arch/powerpc/include/asm/book3s/64/hash-4k.h |3 ++ arch/powerpc/include/asm/book3s/64/hash-64k.h | 29 ++- arch/powerpc/include/asm/book3s/64/hash.h |3 -- arch/powerpc/mm/hash64_64k.c | 30 ++-- arch/powerpc/mm/hugetlbpage-hash64.c | 22 ++ 5 files changed, 55 insertions(+), 32 deletions(-) diff --git a/arch/powerpc/include/asm/book3s/64/hash-4k.h b/arch/powerpc/include/asm/book3s/64/hash-4k.h index a306c0a..1e60099 100644 --- a/arch/powerpc/include/asm/book3s/64/hash-4k.h +++ b/arch/powerpc/include/asm/book3s/64/hash-4k.h @@ -16,6 +16,9 @@ #define H_PUD_TABLE_SIZE (sizeof(pud_t) << H_PUD_INDEX_SIZE) #define H_PGD_TABLE_SIZE (sizeof(pgd_t) << H_PGD_INDEX_SIZE) +#define H_PAGE_F_GIX_SHIFT 56 +#define H_PAGE_F_SECOND_RPAGE_RSV2 /* HPTE is in 2ndary HPTEG */ +#define H_PAGE_F_GIX (_RPAGE_RSV3 | _RPAGE_RSV4 | _RPAGE_RPN44) #define H_PAGE_BUSY_RPAGE_RSV1 /* software: PTE & hash are busy */ /* PTE flags to conserve for HPTE identification */ diff --git a/arch/powerpc/include/asm/book3s/64/hash-64k.h b/arch/powerpc/include/asm/book3s/64/hash-64k.h index 62e580c..c281f18 100644 --- a/arch/powerpc/include/asm/book3s/64/hash-64k.h +++ b/arch/powerpc/include/asm/book3s/64/hash-64k.h @@ -12,7 +12,7 @@ */ #define H_PAGE_COMBO _RPAGE_RPN0 /* this is a combo 4k page */ #define H_PAGE_4K_PFN _RPAGE_RPN1 /* PFN is for a single 4k page */ -#define H_PAGE_BUSY_RPAGE_RPN42 /* software: PTE & hash are busy */ +#define H_PAGE_BUSY_RPAGE_RPN44 /* software: PTE & hash are busy */ /* * We need to differentiate between explicit huge page and THP huge @@ -21,8 +21,7 @@ #define H_PAGE_THP_HUGE H_PAGE_4K_PFN /* PTE flags to conserve for HPTE identification */ -#define _PAGE_HPTEFLAGS (H_PAGE_BUSY | H_PAGE_F_SECOND | \ -H_PAGE_F_GIX | H_PAGE_HASHPTE | H_PAGE_COMBO) +#define _PAGE_HPTEFLAGS (H_PAGE_BUSY | H_PAGE_HASHPTE | H_PAGE_COMBO) /* * we support 16 fragments per PTE page of 64K size. */ @@ -50,24 +49,22 @@ static inline real_pte_t __real_pte(pte_t pte, pte_t *ptep) unsigned long *hidxp; rpte.pte = pte; - rpte.hidx = 0; - if (pte_val(pte) & H_PAGE_COMBO) { - /* -* Make sure we order the hidx load against the H_PAGE_COMBO -* check. The store side ordering is done in __hash_page_4K -*/ - smp_rmb(); - hidxp = (unsigned long *)(ptep + PTRS_PER_PTE); - rpte.hidx = *hidxp; - } + /* +* Ensure that we do not read the hidx before we read +* the pte. Because the writ