Re: [Dwarf-discuss] Clarification: DW_OP_mod doesn't specify which definition of modulo

Ben Woodard via Dwarf-discuss Fri, 07 Nov 2025 11:03:15 -0800


On 11/7/25 9:18 AM, Mark Wielaard wrote:

Hi Ben,


On Fri, 2025-11-07 at 08:09 -0800, Ben Woodard wrote:

On 11/7/25 6:58 AM, Mark Wielaard wrote:

On Thu, 2025-11-06 at 11:34 -0800, Ben Woodard via Dwarf-discuss wrote:

I think renaming is really confusing. And I think extending to
supporting floating point types should be a separate issue that would
also look at the other operators.

Maybe a compromise would be to keep DW_OP_mod (and make DW_OP_rem an
alias?)

I would do it the other way around make DW_OP_mod be a legacy alias and
call the same operation DW_OP_rem.

I think that is fine, as long as they have the same constant value
(0x1d).

Agreed. Same encoding. Just in the header files there are two defineswhich point to the same constant value. Old consumers can continue toprint DW_OP_mod (just like DW_OP_push_object_{address,location} butconsumer's human readable strings should be updated to DW_OP_rem.

Honestly, as a concession while I think it would be less confusing torename the operator. I'm really fine with keeping DW_OP_mod as a name solong as the domain is expanded to include signed and unsigned integraltypes and the algorithm and domain of the operator is documented in thestandard.

With Jakub's example, I think that we have a compelling reason to expand
the domain of DW_OP_rem (the former DW_OP_mod) to include signed
integral types as well as unsigned integral types. His example seems to
require the semantics of C99's % operator (truncated division).

If we do this, then it will be backward compatible. The only thing that
we would be changing is the domain over which the current DW_OP_mod
operates. We are not changing any of the semantics.

Because the semantics weren't really defined for the expanded domain,
which means e.g. gdb does interpret it differently.

Agreed GDB will have to change. However, when we discovered that itimplemented DW_OP_mod using language specific semantics from the UI,everyone including the GDB developers on the call agreed that was thewrong thing to do. IIRC there are some 200 languages that are supportedby DWARF and not all of them have specified behavior for for signed mod.Thus trying to interpret DW_OP_mod in a language specific way wasuniversally deemed "wrong" or even "insane". Evidently, in the DWARFcommittee meeting, having the implementation tied to the source languagewas universally panned.

In the DWARF for GPUs meeting, we decided to put forth a proposalspecifying that DWARF operations were their own thing and not tied tothe source language. I was assigned the task to draft that proposal butI have yet to do so yet.

I'm ambivalent about expanding the domain to floating point values. If
someone has a reason for having it work on floating point types, then
sure why not. It is a bit of extra code in every consumer but whatever.

And it makes us have to define how floating point values are
represented and pick a specific interpretation of the operations.

My understanding is that the DWARF committee already decided to allowfloating point numbers on the stack. I was not around for this and I donot know the exact reasoning. My guess as to why that was done is sothat if a FP number was optimized out, it could still be reconstructedwith DWARF expressions and then represented in implicit storage. Thiswould suggest encoding of FP numbers would have to follow the consumer'starget architecture and interpretation of operators would need to bespecific enough to allow the unambiguous reconstruction of the optimizedout variable on that target architecture.

That being the case, certainly many of the arithmetic operators wouldneed to be defined for floating point base types. However, the thingthat gives me pause with DW_OP_rem (or its old name DW_OP_mod) is thateven in modern C or C++ modulo for floating point numbers is a functioncall, fmod(), not a primitive operation the way that % is.

What I really care about is that when we update the description of
DW_OP_rem (the operation formerly known as  DW_OP_mod) we specify both
the domain of the operator as well as the algorithm used.

I think doing both a renaming and redefining the interpretation of the
operation and domain at the same time is super confusing. Better to
just introduce two new operations for the expanded domain.


I disagree with you on this particular point.

As a counter argument I point to DW_OP_push_object_addr which is now alegacy alias with the same encoding as the currentDW_OP_push_object_location. The information extracted from executioncontext is now a fully specified location rather than just a genericvalue for the address. We changed both the name and changed what it doesin a backward compatible way. I argue that */_expanding_/* the domain ofoperators in a backward compatible way is a minor change.

As I went through all the operators makinghttps://github.com/woodard/dwarf-locations/blob/op-formatting/024-revise-operations.mdand I made a bunch of notes where I think the domain of operationsshould be specified or in some cases expanded. I have to write all ofthose up. They include:

Does DW_OP_regval_type really need to be limited to a base type? (vectorregisters)Does DW_OP_regval_bits really need to be limited to the number of bitsof a generic? (large vector predicate registers)Why can't the logical operators also be applied to integral vectorregisters and vector predicate registers?Why can't we mix vector integral types with integral types when doingarithmetic operations? there are literally opcodes in many ISAs for this.DW_OP_shl and DW_OP_shr should be also work on vector registers. Thiscan be used for lane shifting.

Are DW_OP_shl and DW_OP_shr defined for negative shifts? (clarification)
...

I believe major versions like the DWARF6 we are building toward are thetime to clean things like this up.

We are also down to only about 50 available opcodes in the single byteoperation encoding space, and so we need to a bit careful about how manynew ones we allocate until we all agree to have a flag day and breakcompatibility with DWARF2-?.

   Then introduce new DW_OP_modulo (defined using floored
division)

Again I'm personally ambivalent about the need for this. I don't think
that it is going to be used very often and I think if we do define it we
should consider pushing it into the new DW_OP_extended operation
encoding space. This will make its encoding a two byte operation but it
will reserve more of the one byte encodings for more frequently used
operations.

Sure DW_OP_modulo and DW_OP_remainder could be "extended" operations.


Agree.

My ambivalence to a true modulo is because unlike truncated division akaremainder which is used for address arithmetic within both the signedand unsigned domains, true modulo on FP numbers and even truncateddivision on FP numbers is a function call in C/C++.

I'm happy to let everyone else discuss and decide if we need an actualmodulo in DWARF.

and DW_OP_remainder (defined using truncated division)
operators that are only to be used with typed DWARF stack values?

I really do believe that a better approach is to rename and expand the
domain of the current DW_OP_mod rather than adding another new special
purpose operator.

I disagree. I think just leave DW_OP_mod for legacy operation on the
generic type and have two clearly defined new DW_OP_modulo and
DW_OP_remainder for typed DWARF stack values is much clearer.

I am happy to let the overall committee decide this.

We agree on most points have a minor disagreement on a couple of narrowpoints. We can sort those out in committee.


-ben


Cheers,

Mark

-- 
Dwarf-discuss mailing list
[email protected]
https://lists.dwarfstd.org/mailman/listinfo/dwarf-discuss

Re: [Dwarf-discuss] Clarification: DW_OP_mod doesn't specify which definition of modulo

Reply via email to