>-----Original Message-----
>From: Honnappa Nagarahalli [mailto:honnappa.nagaraha...@arm.com]
>Sent: Tuesday, June 25, 2019 2:15 PM
>To: Wang, Yipeng1 <yipeng1.w...@intel.com>; Gobriel, Sameh 
><sameh.gobr...@intel.com>; Richardson, Bruce
><bruce.richard...@intel.com>; De Lara Guarch, Pablo 
><pablo.de.lara.gua...@intel.com>; honnappa.nagaraha...@arm.com
>Cc: gavin...@arm.com; ruifeng.w...@arm.com; dev@dpdk.org; n...@arm.com; 
>sta...@dpdk.org
>Subject: [PATCH 2/3] lib/hash: load pData after full key compare
>
>When a hash entry is added, there are 2 sets of stores.
>
>1) The application writes its data to memory (whose address
>is provided in rte_hash_add_key_with_hash_data API (or NULL))
>2) The rte_hash library writes to its own internal data structures;
>key store entry and the hash table.
>
>The only ordering requirement between these 2 is that - the store
>to the application data must complete before the store to key_index.
>There are no ordering requirements between the stores to the
>key/signature and store to application data. The synchronization
>point for application data can be any point between the 'store to
>application data' and 'store to the key_index'. So, pData should not
>be a guard variable for the data in hash table. It should be a guard
>variable only for the application data written to the memory location
>pointed by pData. Hence, pData can be loaded after full key comparison.
>
>Fixes: e605a1d36 ("hash: add lock-free r/w concurrency")
>Cc: sta...@dpdk.org
>
>Signed-off-by: Honnappa Nagarahalli <honnappa.nagaraha...@arm.com>
>Reviewed-by: Gavin Hu <gavin...@arm.com>
>Tested-by: Ruifeng Wang <ruifeng.w...@arm.com>
>---
> lib/librte_hash/rte_cuckoo_hash.c | 67 +++++++++++++++----------------
> 1 file changed, 32 insertions(+), 35 deletions(-)
>
>diff --git a/lib/librte_hash/rte_cuckoo_hash.c 
>b/lib/librte_hash/rte_cuckoo_hash.c
>index f37f6957d..077328fed 100644
>--- a/lib/librte_hash/rte_cuckoo_hash.c
>+++ b/lib/librte_hash/rte_cuckoo_hash.c
>@@ -649,9 +649,11 @@ search_and_update(const struct rte_hash *h, void *data, 
>const void *key,
>                       k = (struct rte_hash_key *) ((char *)keys +
>                                       bkt->key_idx[i] * h->key_entry_size);
>                       if (rte_hash_cmp_eq(key, k->key, h) == 0) {
>-                              /* 'pdata' acts as the synchronization point
>-                               * when an existing hash entry is updated.
>-                               * Key is not updated in this case.
>+                              /* The store to application data at *data
>+                               * should not leak after the store to pdata
>+                               * in the key store. i.e. pdata is the guard
>+                               * variable. Release the application data
>+                               * to the readers.
>                                */
>                               __atomic_store_n(&k->pdata,
>                                       data,
>@@ -711,11 +713,10 @@ rte_hash_cuckoo_insert_mw(const struct rte_hash *h,
>               /* Check if slot is available */
>               if (likely(prim_bkt->key_idx[i] == EMPTY_SLOT)) {
>                       prim_bkt->sig_current[i] = sig;
>-                      /* Key can be of arbitrary length, so it is
>-                       * not possible to store it atomically.
>-                       * Hence the new key element's memory stores
>-                       * (key as well as data) should be complete
>-                       * before it is referenced.
>+                      /* Store to signature and key should not
>+                       * leak after the store to key_idx. i.e.
>+                       * key_idx is the guard variable for signature
>+                       * and key.
>                        */
>                       __atomic_store_n(&prim_bkt->key_idx[i],
>                                        new_idx,
>@@ -990,17 +991,15 @@ __rte_hash_add_key_with_hash(const struct rte_hash *h, 
>const void *key,
>
>       new_k = RTE_PTR_ADD(keys, (uintptr_t)slot_id * h->key_entry_size);
>       new_idx = (uint32_t)((uintptr_t) slot_id);
>-      /* Copy key */
>-      memcpy(new_k->key, key, h->key_len);
>-      /* Key can be of arbitrary length, so it is not possible to store
>-       * it atomically. Hence the new key element's memory stores
>-       * (key as well as data) should be complete before it is referenced.
>-       * 'pdata' acts as the synchronization point when an existing hash
>-       * entry is updated.
>+      /* The store to application data (by the application) at *data should
>+       * not leak after the store of pdata in the key store. i.e. pdata is
>+       * the guard variable. Release the application data to the readers.
>        */
>       __atomic_store_n(&new_k->pdata,
>               data,
>               __ATOMIC_RELEASE);
[Wang, Yipeng] Actually do we need to guard pdata for the newly inserted key?  
I thought the guard of key_idx already can make sure
The order for the application to read data. 
>+      /* Copy key */
>+      memcpy(new_k->key, key, h->key_len);
[Wang, Yipeng] You don't need to do the order change just to show the point of 
unnecessary ordering I think.
I am afraid it may cause further confusion for future people who read this 
change, especially it is not in the commit
Message (and it is a bug fix). 
>

Reply via email to