18/04/2023 14:55, Bruce Richardson: > On Tue, Apr 18, 2023 at 01:29:49PM +0200, Morten Brørup wrote: > > From: Bruce Richardson [mailto:bruce.richard...@intel.com] > > > On Tue, Apr 11, 2023 at 08:48:45AM +0200, Morten Brørup wrote: > > > > + if (__extension__(__builtin_constant_p(n)) && n <= cache->len) { > > > > + /* > > > > + * The request size is known at build time, and > > > > + * the entire request can be satisfied from the cache, > > > > + * so let the compiler unroll the fixed length copy > > > > loop. > > > > + */ > > > > + cache->len -= n; > > > > + for (index = 0; index < n; index++) > > > > + *obj_table++ = *--cache_objs; > > > > + > > > > > > This loop looks a little awkward to me. Would it be clearer (and perhaps > > > easier for compilers to unroll efficiently if it was rewritten as: > > > > > > cache->len -= n; > > > cache_objs = &cache->objs[cache->len]; > > > for (index = 0; index < n; index++) > > > obj_table[index] = cache_objs[index]; > > > > The mempool cache is a stack, so the copy loop needs get the objects in > > decrementing order. I.e. the source index decrements and the destination > > index increments. > > > > BTW: Please add this as a comment in the code too, above the loop to avoid > future developers (or even future me), asking this question again!
Looks like this request was missed.