On Wed, Oct 10, 2018 at 04:08:40PM -0700, Alexander Duyck wrote: > This change makes it so that we call the asynchronous probe routines on a > CPU local to the device node. By doing this we should be able to improve > our initialization time significantly as we can avoid having to access the > device from a remote node which may introduce higher latency.
This is nice in theory, but what kind of real numbers does this show? There's a lot of added complexity here, and what is the benifit? Benchmarks or bootcharts that we can see would be great to have, thanks. greg k-h