- Add the scattered Rx burst function for 32B legacy descriptor - Add the NEON-optimised Tx burst function - Add the NEON-optimised Rx burst functions for flexible descriptor
Single-core benchmark result on a Grace server with an Intel E810: | buffer sz (B) | prev (MPPS) | optimised (MPPS) | uplift | | ------------- | ----------- | ---------------- | ------ | | 64 | 62.241 | 86.010 | 38.2% | | 128 | 60.813 | 81.453 | 33.9% | | 256 | 23.730 | 24.633 | 3.8% | | 512 | 10.268 | 10.380 | 1.1% | | 1024 | 4.588 | 4.628 | 0.9% | | 1518 | 4.601 | 4.669 | 1.5% | --- v3: - Optimised the branch prediction in counting packets by changing unlikely() to likely() - Added the single-core performance data from Grace + E810 done by Andrew v2: - Fixed the AArch32 cross-building issue on AArch64 by restricting the call of iavf_rxq_vec_setup to AArch64 only Jay Wang (3): net/iavf: add Rx scattered function for 32B desc net/iavf: add NEON-optimised Tx burst function net/iavf: add NEON support for Rx flex desc drivers/net/intel/iavf/iavf.h | 4 + drivers/net/intel/iavf/iavf_rxtx.c | 54 +- drivers/net/intel/iavf/iavf_rxtx.h | 2 - drivers/net/intel/iavf/iavf_rxtx_vec_neon.c | 753 +++++++++++++++++++- drivers/net/intel/iavf/meson.build | 2 +- 5 files changed, 797 insertions(+), 18 deletions(-) -- 2.43.0

