>+movu m1, [r2] >+punpcklbw m2, m1, m0 Here have a hide register copy, try to avoid it by SSE4.1 "pmovzxbw m2, m1" >+movu [r0], m2 >+punpckhbw m1, m0 >+movu [r0 + 16], m1
_______________________________________________ x265-devel mailing list [email protected] https://mailman.videolan.org/listinfo/x265-devel
