On 08/16/2016 05:02 AM, vijay.kil...@gmail.com wrote:
+static inline void prefetch_vector_loop(const VECTYPE *p, int index)
+{
+#if defined(__aarch64__)
+    if (is_thunderx_pass2_cpu()) {
+        /* Prefetch 4 cache lines ahead from index */
+        VEC_PREFETCH(p, index + (BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR * 
4));
+    }
+#endif
+}

Oh come now. This is even worse than before. A function call protecting a mere prefetch within the main body of an inner loop?

Did you not understand what I was asking for?


r~

Reply via email to