On 05/16/2017 04:01 PM, Aurelien Jarno wrote:
Instead of byteswapping individual 16-bit words one by one, work on the
whole register at the same time using shifts and mask. This is the same
strategy than the aarch32 version of rev16 and is much more efficient
in the case sf=1.

Signed-off-by: Aurelien Jarno<aurel...@aurel32.net>
---
  target/arm/translate-a64.c | 24 ++++++------------------
  1 file changed, 6 insertions(+), 18 deletions(-)

Reviewed-by: Richard Henderson <r...@twiddle.net>


r~


Reply via email to