On 08/16/2016 05:02 AM, vijay.kil...@gmail.com wrote:
+static inline void prefetch_vector_loop(const VECTYPE *p, int index) +{ +#if defined(__aarch64__) + if (is_thunderx_pass2_cpu()) { + /* Prefetch 4 cache lines ahead from index */ + VEC_PREFETCH(p, index + (BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR * 4)); + } +#endif +}
Oh come now. This is even worse than before. A function call protecting a mere prefetch within the main body of an inner loop?
Did you not understand what I was asking for? r~