On 19/5/26 18:22, James Hilliard wrote:
VMULU multiplies the active Octeon multiplier state by rs, adds rt and
queued partial products, returns the low result, and advances P[0]/P[1]
with carry limbs.

Expand the two-limb accumulator operation inline with TCG so the result
and partial-product state stay visible to the optimizer.

Signed-off-by: James Hilliard <[email protected]>
Signed-off-by: Richard Henderson <[email protected]>

---
Changes v2 -> v3:
   - Split VMULU out of the combined Octeon arithmetic and memory
     instruction patch.  (requested by Richard Henderson)

Changes v5 -> v6:
   - Rename the translator helper callback typedef for clarity.

Changes v7 -> v8:
   - Use Richard Henderson's v7.5 inline TCG translator with
     tcg_gen_addN_i64.
---
  target/mips/tcg/octeon.decode      |  2 ++
  target/mips/tcg/octeon_translate.c | 35 +++++++++++++++++++++++++++++++++++
  2 files changed, 37 insertions(+)

Tested-by: Philippe Mathieu-Daudé <[email protected]>

Reply via email to