On 2/25/20 2:52 AM, Yanqin Wei wrote:
> Read/write concurrency of pvector library is implemented by a temp vector
> and RCU protection. Considering performance reason, insertion does not
> follow this scheme.
> In insertion function, a thread fence ensures size incrementation is done
> after new entry is stored. But there is no barrier in the iteration
> fuction(pvector_cursor_init). Entry point access may be reorderd before
> loading vector size, so the invalid entry point may be loaded when vector
> iteration.
> This patch fixes it by acquire-release pair. It can guarantee new size is
> observed by reader after new entry stored by writer. And this is
> implemented by one-way barrier instead of two-way memory fence.
> 
> Reviewed-by: Gavin Hu <gavin...@arm.com>
> Reviewed-by: Lijian Zhang <lijian.zh...@arm.com>
> Signed-off-by: Yanqin Wei <yanqin....@arm.com>
> ---
>  lib/pvector.c | 14 +++++++-------
>  lib/pvector.h | 12 +++++++-----
>  2 files changed, 14 insertions(+), 12 deletions(-)
> 
> diff --git a/lib/pvector.c b/lib/pvector.c
> index aaeee9214..12c599c97 100644
> --- a/lib/pvector.c
> +++ b/lib/pvector.c
> @@ -33,7 +33,7 @@ pvector_impl_alloc(size_t size)
>      struct pvector_impl *impl;
>  
>      impl = xmalloc(sizeof *impl + size * sizeof impl->vector[0]);
> -    impl->size = 0;
> +    atomic_init(&impl->size, 0);
>      impl->allocated = size;
>  
>      return impl;
> @@ -117,18 +117,18 @@ pvector_insert(struct pvector *pvec, void *ptr, int 
> priority)
>  {
>      struct pvector_impl *temp = pvec->temp;
>      struct pvector_impl *old = pvector_impl_get(pvec);
> +    size_t size = old->size;

Why this is not an atomic read?  I understand that insertions are not 
thread-safe
and must be protected by the mutex or be always executed from the same thread.
However, if we're choosing to read this variable non-atomically, we could avoid
introduction of additional variable here and at the same time avoid modification
of most of the code lines in this function.  A comment, why we're reading it
non-atomically might be good anyway since we should be consistent and use atomic
operations for variables marked as atomic as possible.

>  
>      ovs_assert(ptr != NULL);
>  
>      /* Check if can add to the end without reallocation. */
> -    if (!temp && old->allocated > old->size &&
> -        (!old->size || priority <= old->vector[old->size - 1].priority)) {
> -        old->vector[old->size].ptr = ptr;
> -        old->vector[old->size].priority = priority;
> +    if (!temp && old->allocated > size &&
> +        (!size || priority <= old->vector[size - 1].priority)) {
> +        old->vector[size].ptr = ptr;
> +        old->vector[size].priority = priority;
>          /* Size increment must not be visible to the readers before the new
>           * entry is stored. */
> -        atomic_thread_fence(memory_order_release);
> -        ++old->size;
> +        atomic_store_explicit(&old->size, size + 1, memory_order_release);
>      } else {
>          if (!temp) {
>              temp = pvector_impl_dup(old);
> diff --git a/lib/pvector.h b/lib/pvector.h
> index b990ed9d5..430bdf746 100644
> --- a/lib/pvector.h
> +++ b/lib/pvector.h
> @@ -69,8 +69,8 @@ struct pvector_entry {
>  };
>  
>  struct pvector_impl {
> -    size_t size;       /* Number of entries in the vector. */
> -    size_t allocated;  /* Number of allocated entries. */
> +    ATOMIC(size_t) size;  /* Number of entries in the vector. */

atomic_size_t

> +    size_t allocated;     /* Number of allocated entries. */
>      struct pvector_entry vector[];
>  };
>  
> @@ -172,7 +172,7 @@ static inline void pvector_cursor_lookahead(const struct 
> pvector_cursor *,
>  #define PVECTOR_CURSOR_FOR_EACH_CONTINUE(PTR, CURSOR)                   \
>      for (; ((PTR) = pvector_cursor_next(CURSOR, INT_MIN, 0, 0)) != NULL; )
>  
> -

Don't remove the form feed character.

> +
>  /* Inline implementations. */
>  
>  static inline struct pvector_cursor
> @@ -181,12 +181,14 @@ pvector_cursor_init(const struct pvector *pvec,
>  {
>      const struct pvector_impl *impl;
>      struct pvector_cursor cursor;
> +    size_t size;
>  
>      impl = ovsrcu_get(struct pvector_impl *, &pvec->impl);
>  
> -    ovs_prefetch_range(impl->vector, impl->size * sizeof impl->vector[0]);
> +    atomic_read_explicit(&impl->size, &size, memory_order_acquire);
> +    ovs_prefetch_range(impl->vector, size * sizeof impl->vector[0]);
>  
> -    cursor.size = impl->size;
> +    cursor.size = size;
>      cursor.vector = impl->vector;
>      cursor.entry_idx = -1;
>  
> 

_______________________________________________
dev mailing list
d...@openvswitch.org
https://mail.openvswitch.org/mailman/listinfo/ovs-dev

Reply via email to