On Wed, Jul 29, 2020 at 3:14 PM Dumitrescu, Cristian
<cristian.dumitre...@intel.com> wrote:
> > Please correct me if I am wrong, but it simply means this part of the
> > table library never worked for 32-bit.
> > It seems more adding 32-bit support rather than a fix and then I
> > wonder if it has its place in rc3.
> >
>
> Functionally. the code works, but performance is affected.
>
> The only thing that prevents the code from working is the check in the table 
> create function that checks the size of the above structure is 64 bytes, 
> which caught this issue.

Yes, and that's my point.
It was not working.
It was not tested.


This patch asks for backport in stable branches, I will let Kevin and
Luca comment.


>
> >
> >
> > Now, looking at the details:
> >
> > For 64-bit on my x86, we have:
> >
> > struct rte_bucket_4_8 {
> >     uint64_t                   signature;            /*     0     8 */
> >     uint64_t                   lru_list;             /*     8     8 */
> >     struct rte_bucket_4_8 *    next;                 /*    16     8 */
> >     uint64_t                   next_valid;           /*    24     8 */
> >     uint64_t                   key[4];               /*    32    32 */
> >     /* --- cacheline 1 boundary (64 bytes) --- */
> >     uint8_t                    data[];               /*    64     0 */
> >
> >     /* size: 64, cachelines: 1, members: 6 */
> > };
> >
> >
> > For 32-bit, we have:
> >
> > struct rte_bucket_4_8 {
> >     uint64_t                   signature;            /*     0     8 */
> >     uint64_t                   lru_list;             /*     8     8 */
> >     struct rte_bucket_4_8 *    next;                 /*    16     4 */
> >     uint64_t                   next_valid;           /*    20     8 */
> >     uint64_t                   key[4];               /*    28    32 */
> >     uint8_t                    data[];               /*    60     0 */
> >
> >     /* size: 60, cachelines: 1, members: 6 */
> >     /* last cacheline: 60 bytes */
> > } __attribute__((__packed__));
> >
> > ^^ it is interesting that a packed attribute ends up here.
> > I saw no such attribute in the library code.
> > Compiler black magic at work I guess...
> >
>
> Where do you see the packet attribute? I don't see it in the code.

That's pahole reporting this.
Maybe the tool extrapolates this attribute based on the next_valid
field placement... I don't know.

> A packet attribute would explain this issue, i.e. why did the compiler decide 
> not to insert an expected padfing of 4 bytes right after the "next" field, 
> that would allow the field "next_valid" to be aligned to its natural boundary 
> of 8 bytes.

Or a 64-bit field on 32-bit has a special alignment that I am not aware of.


>
> >
> > >
> > > Fixes: 8aa327214c ("table: hash")
> > > Cc: sta...@dpdk.org
> > >
> > > Signed-off-by: Ting Xu <ting...@intel.com>
> > >
> > > ---
> > > v3->v4: Change design based on comment
> > > v2->v3: Rebase
> > > v1->v2: Correct patch time
> > > ---
> > >  lib/librte_table/rte_table_hash_key16.c | 17 +++++++++++++++++
> > >  lib/librte_table/rte_table_hash_key32.c | 17 +++++++++++++++++
> > >  lib/librte_table/rte_table_hash_key8.c  | 16 ++++++++++++++++
> > >  3 files changed, 50 insertions(+)
> > >
> > > diff --git a/lib/librte_table/rte_table_hash_key16.c
> > b/lib/librte_table/rte_table_hash_key16.c
> > > index 2cca1c924..c4384b114 100644
> > > --- a/lib/librte_table/rte_table_hash_key16.c
> > > +++ b/lib/librte_table/rte_table_hash_key16.c
> > > @@ -33,6 +33,7 @@
> > >
> > >  #endif
> > >
> > > +#ifdef RTE_ARCH_64
> > >  struct rte_bucket_4_16 {
> > >         /* Cache line 0 */
> > >         uint64_t signature[4 + 1];
> > > @@ -46,6 +47,22 @@ struct rte_bucket_4_16 {
> > >         /* Cache line 2 */
> > >         uint8_t data[0];
> > >  };
> > > +#else
> > > +struct rte_bucket_4_16 {
> > > +       /* Cache line 0 */
> > > +       uint64_t signature[4 + 1];
> > > +       uint64_t lru_list;
> > > +       struct rte_bucket_4_16 *next;
> > > +       uint32_t pad;
> > > +       uint64_t next_valid;
> > > +
> > > +       /* Cache line 1 */
> > > +       uint64_t key[4][2];
> > > +
> > > +       /* Cache line 2 */
> > > +       uint8_t data[0];
> > > +};
> > > +#endif
> >
> > The change could simply be:
> >
> > @@ -38,6 +38,9 @@ struct rte_bucket_4_16 {
> >         uint64_t signature[4 + 1];
> >         uint64_t lru_list;
> >         struct rte_bucket_4_16 *next;
> > +#ifndef RTE_ARCH_64
> > +       uint32_t pad;
> > +#endif
> >         uint64_t next_valid;
> >
> >         /* Cache line 1 */
> >
> > It avoids duplicating the whole structure definition (we could miss
> > updating one side of the #ifdef later).
> > Idem for the other "8" and "32" structures.


What about this comment?


-- 
David Marchand

Reply via email to