On Thu, 21 Dec 2017, kemi wrote:

> Some thinking about that:
> a) the overhead due to cache bouncing caused by NUMA counter update in fast 
> path
> severely increase with more and more CPUs cores
> b) AFAIK, the typical usage scenario (similar at least)for which this 
> optimization can
> benefit is 10/40G NIC used in high-speed data center network of cloud service 
> providers.

I think you are fighting a lost battle there. As evident from the timing
constraints on packet processing in a 10/40G you will have a hard time to
process data if the packets are of regular ethernet size. And we alrady
have 100G NICs in operation here.

We can try to get the performance as high as possible but full rate high
speed networking invariable must use offload mechanisms and thus the
statistics would only be available from the hardware devices that can do
wire speed processing.

Reply via email to