Re: [Qemu-devel] [PATCH 2/4] target/arm: simplify and optimize aarch64 rev16

2017-05-22 Thread Richard Henderson
On 05/16/2017 04:01 PM, Aurelien Jarno wrote: Instead of byteswapping individual 16-bit words one by one, work on the whole register at the same time using shifts and mask. This is the same strategy than the aarch32 version of rev16 and is much more efficient in the case sf=1. Signed-off-by:

[Qemu-devel] [PATCH 2/4] target/arm: simplify and optimize aarch64 rev16

2017-05-16 Thread Aurelien Jarno
Instead of byteswapping individual 16-bit words one by one, work on the whole register at the same time using shifts and mask. This is the same strategy than the aarch32 version of rev16 and is much more efficient in the case sf=1. Signed-off-by: Aurelien Jarno ---