On Sat, Mar 25, 2017 at 6:54 PM, Tom Herbert <t...@herbertland.com> wrote: > On Fri, Mar 24, 2017 at 2:52 PM, Saeed Mahameed <sae...@mellanox.com> wrote: >> Hi Dave, >> >> This series provides some preformancee optimizations for mlx5e >> driver, especially for XDP TX flows. >> >> 1st patch is a simple change of rmb to dma_rmb in CQE fetch routine >> which shows a huge gain for both RX and TX packet rates. >> >> 2nd patch removes write combining logic from the driver TX handler >> and simplifies the TX logic while improving TX CPU utilization. >> >> All other patches combined provide some refactoring to the driver TX >> flows to allow some significant XDP TX improvements. >> >> More details and performance numbers per patch can be found in each patch >> commit message compared to the preceding patch. >> >> Overall performance improvemnets >> System: Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz >> >> Test case Baseline Now improvement >> --------------------------------------------------------------- >> TX packets (24 threads) 45Mpps 54Mpps 20% >> TC stack Drop (1 core) 3.45Mpps 3.6Mpps 5% >> XDP Drop (1 core) 14Mpps 16.9Mpps 20% >> XDP TX (1 core) 10.4Mpps 13.7Mpps 31% >> > Awesome, and good timing. I'll be presenting XDP at IETF next and > would like to include these numbers in the presentation if you don't > mind... >
Not at all, please go ahead. But as you see, the system i tested on is not that powerful. We can get even better results with a modern system. If you want i can provide you those numbers by mid-week.