On 9/11/2020 6:38 AM, Phil Yang wrote:
Using 'rte_mb' to synchronize the shared ring head/tail between producer and consumer will stall the pipeline and damage performance on the weak memory model platforms, such like aarch64.Relax the expensive barrier with c11 atomic with explicit memory ordering can improve 3.6% performance on throughput. Signed-off-by: Phil Yang <[email protected]> Reviewed-by: Ruifeng Wang <[email protected]>
Hi Jakub, Can you please check/test this patch? Thanks, ferruh

