On 19/5/26 18:22, James Hilliard wrote:
VMULU multiplies the active Octeon multiplier state by rs, adds rt and
queued partial products, returns the low result, and advances P[0]/P[1]
with carry limbs.
Expand the two-limb accumulator operation inline with TCG so the result
and partial-product state stay visible to the optimizer.
Signed-off-by: James Hilliard <[email protected]>
Signed-off-by: Richard Henderson <[email protected]>
---
Changes v2 -> v3:
- Split VMULU out of the combined Octeon arithmetic and memory
instruction patch. (requested by Richard Henderson)
Changes v5 -> v6:
- Rename the translator helper callback typedef for clarity.
Changes v7 -> v8:
- Use Richard Henderson's v7.5 inline TCG translator with
tcg_gen_addN_i64.
---
target/mips/tcg/octeon.decode | 2 ++
target/mips/tcg/octeon_translate.c | 35 +++++++++++++++++++++++++++++++++++
2 files changed, 37 insertions(+)
Tested-by: Philippe Mathieu-Daudé <[email protected]>