On Tue, Aug 16, 2016 at 11:32 PM, Richard Henderson <r...@twiddle.net> wrote: > On 08/16/2016 05:02 AM, vijay.kil...@gmail.com wrote: >> >> +static inline void prefetch_vector_loop(const VECTYPE *p, int index) >> +{ >> +#if defined(__aarch64__) >> + if (is_thunderx_pass2_cpu()) { >> + /* Prefetch 4 cache lines ahead from index */ >> + VEC_PREFETCH(p, index + (BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR >> * 4)); >> + } >> +#endif >> +} > > > Oh come now. This is even worse than before. A function call protecting a > mere prefetch within the main body of an inner loop? > > Did you not understand what I was asking for?
No, Could you please detail the problem?. > > > r~