After some investigation into Andrew's report of corruption in his ppc64le tests at https://lists.gnu.org/archive/html/qemu-devel/2019-02/msg07234.html, I discovered the underlying cause was that the first 32 VSX registers are not stored in host endian order.
This is something that Richard and I had discussed before, but missed that with VSX if you have source registers from different register sets then even logical operations will give you the wrong result. Rather than revert 7b8fe477e1 "target/ppc: convert VSX logical operations to vector operations" let's keep the use of the accelerated vector instructions, and instead fix the real problem which is to switch the first 32 VSX registers to host endian order matching the VMX registers. Patches 1-5 aim to consolidate the offset calculations for both CPUPPCState and the associated _ptr() functions into one single place. With this preliminary work complete, patch 6 switches the first 32 registers into host endian order without too much difficulty. Finally now that all VSX registers are stored in the same way, the vsr offset functions and get_cpu_vsrh()/get_cpu_vsrl() can be simplified accordingly. Signed-off-by: Mark Cave-Ayland <mark.cave-ayl...@ilande.co.uk> Mark Cave-Ayland (8): target/ppc: introduce single fpr_offset() function target/ppc: introduce single vsrl_offset() function target/ppc: move Vsr* macros from internal.h to cpu.h target/ppc: introduce avrh_offset() and avrl_offset() functions target/ppc: introduce avr_offset() function target/ppc: switch fpr/vsrl registers so all VSX registers are in host endian order target/ppc: introduce vsrh_offset() function target/ppc: simplify get_cpu_vsrh() and get_cpu_vsrl() functions target/ppc/cpu.h | 56 +++++++++++++++++++++++++++++++++++-- target/ppc/internal.h | 27 +++--------------- target/ppc/machine.c | 8 +++--- target/ppc/translate.c | 28 ++++++++----------- target/ppc/translate/vmx-impl.inc.c | 27 ++++++++---------- target/ppc/translate/vsx-impl.inc.c | 39 +++----------------------- 6 files changed, 88 insertions(+), 97 deletions(-) -- 2.11.0