Re: [Dwarf-discuss] Clarification: DW_OP_mod doesn't specify which definition of modulo

Ben Woodard via Dwarf-discuss Sat, 27 Sep 2025 15:50:24 -0700


On 9/24/25 4:46 PM, Ron Brender wrote:

For starters, the proposed text is a non-starter (forgive the play onwords)
because there is no Chapter 2 Section 4 in Knuth's The Art of Computer
Programming Volume 1. Chapter 2 is entitled Information Structures, inwhichsection 2.4 (is that what you mean by "Section 4"?) is entitledMultilinked
Structures, and has nothing to do with the modulo operation.
The discussion mentions Section 1.2.4, which is actually in Volume 1,
Chapter 1, is entitled Integer Functions and Elementary Number Theory,
and does define and discuss the modulo operation.

I can correct that. When we were discussing the issue, Cary pulled outhis copy Knuth, mine is tucked away in a box that never got unpacked thelast time that I moved. I misunderstood his citation.

Even if the citation were correct, I would object on the grounds thatI believethe DWARF text should provide the definition, not a citation that thereaderneeds to consult. A footnote to an external source might be OK ifthere were
complicated issues of possible supplementary interest.

I was kind of hoping that the dwarf-discuss community could propose whatthey think it should be and then I'll be happy to write that into thetext of the proposal. I really don't have a strong opinion on the matter.

I just think that whatever the algorithm is, it should be in thestandard. Right now we have:1) GDB implmenting, Knuth's algorithm but limited to signed and unsignedintegers.2) IIUC John DelSignore said TotalView implements DW_OP_mod using C's %operator.3) and we have an email from Michael Eager from 2011 saying thatDW_OP_mod should only apply to unsigned integral types.


Let's all just get on the same page. (I don't care what page it is.)


Finally, the Knuth definition is given in terms of real numbers, of which
integers are a special case, using floor and ceiling operations. This
would be appropriate if DWARF DW_OP_mod were intended to apply

to floating-point operands but is rather pedantic overkill for justintegers.

Yeah but in that section we have two broad classes of operatorsarithmetic and logical. All the other arithmetic operators are definedwith a domain that includes non-integral types. Should we exclude thisone arithmetic operator from that more expansive range?


It is not as if the algorithm is unknown or particularly complicated.

/x/ mod /y/ = /x/ − /y/ × floor(/x/ / /y/), if /y/ ≠ 0; /x/ mod 0 = /x/.

But I think the real problem is not the definition of DW_OP_mod per se but
the definition of the generic type. DWARF Section 2.5.2 defines thegenerictype as an "integral type that has the size of an address on thetarget machineand unspecified signedness." We know that some architectures treataddresses
as signed and some as unsigned integers, and DWARF is trying not to care.
Most of the time it mostly doesn't matter. But to be concrete, whatdoes one make of
     DW_OP_lit5
     DW_OP_lit2
     DW_OP_neg
     DW_OP_mod
If the generic type is signed, then the result is -1. However, if thegeneric typeis thought to be unsigned, then "-2" is just a very large positivenumber and
the result is 5.

exactly!

We might think about solving this problem by defining the generic typeto be
     a) signed
     b) unsigned
     c) signedness implementation-defined
I would not advocate either a) or b). Moreover, I would be verycaution in overturning the"non-signedness" of generic type which has been characteristic ofDWARF from the beginning
(even before the name "generic type" was introduced).

I am either too young or have lived a too sheltered life to be aware ofany architectures which actually have signed addressing. I've seenplenty of cases where there are signed offsets added to unsignedaddresses to make unsigned addresses. This address arithmetic avoids thecomplication of mixed signed and unsigned arithmetic which can slip intonaively written C code.

It seems to me that what you are really wanting with the generic type is"address arithmetic" where you can do "unsigned_address + signed_offset"and not have to worry about the C rules that can cause mixed unsignedand signed arithmetic to yield unexpected results. For example I believethat "unsigned_address + signed_offset" is actually defined to be"unsigned_address + (unsigned) signed_offset" causing a a very largeunsigned value to be added to the unsigned address in the case wheresigned_offset happens to be a small negative.

I think that both "signed" and "unsigned" are kind of C concepts and Iwould suggest that an other option which may provide a way out of thisdilemma is:

d) the generic type is a type is defined to be an integral type suitablefor address arithmetic.

and more specifically this means that when implementing a consumer in Cyou must be careful when mixing signed and unsigned values doingsomething like:


if (signed_offset >=0 )
  unsigned_address+=signed_offset;
else
  unsigned_address-=abs(signed_offset);

There may be other cases where the arithmetic of the "generic type"diverges from integers in C or other language in subtle ways. DW_OP_modmay be one of those but I haven't thought about it enough to be sure.


-ben


Defining DW_OP_mod to be defined only for unsigned integers seems
overkill and unnecessary when no generic type operands are involved.

A more permissive approach is to specify that an operand of the generic

type is implicitly treated as unsigned. Then use the Knuth definitionrestricted tointegers. This is close to Ben's second alternative but furtherresolves the ambiguity

of generic signedness.

Ben has raised a definite problem for which further thought is surelywarranted...

Ron

On Wed, Sep 24, 2025 at 2:32 PM Ben Woodard via Dwarf-discuss<[email protected]> wrote:


    Background:

    Evidently, originally DWARF didn't allow arithmetic operations on
    floating point numbers and most uses of the DWARF stack were done
    with
    the assumption that the values being acted upon were addresses and so
    the computation was assumed to be acting upon unsigned numbers.

    At some point, DWARF began to allow the arithmetic operations to
    work on
    floating point numbers and several operations were explicitly
    defined to
    work over non-integral values. This led to the paragraph in the
    current
    DWARF working draft that says in section 2.5.2.4 on page 37 lines
    24-27:

    "Operations other than DW_OP_abs, DW_OP_div, DW_OP_minus,
    DW_OP_mul, DW_OP_neg and DW_OP_plus require integral types of the
    operand (either integral base type or the generic type).
    Operations do
    not cause
    an exception on overflow."

    Unlike all the other arithmetic operations this explicitly limits
    DW_OP_mod to integral base types and the generic type. It lumps
    DW_OP_mod in with the logical operations. Furthermore, there are
    multiple definitions of the modulo operator which vary in how they
    handle signed values.

    According to the dwarf-discuss archives, this issue came up back
    in 2011
    and at that time Michael Eager made a pronouncement that DW_OP_mod
    used
    the modulo algorithm for unsigned arithmetic. However, this
    decision was
    not recorded in the standard. Since that time, consumers have
    implemented different implementations of DW_OP_mod.

    This proposal seeks to clarify and harmonize the consumer
    implementations of the DW_OP_mod operator by defining which
    algorithm to
    use for signed arithmetic as well as define it for floating point
    numbers.

    Proposal:

    Add DW_OP_mod to the list of operators which do not require integral
    base types by changing:

    Operations other than DW_OP_abs, DW_OP_div, DW_OP_minus, DW_OP_mul,
    DW_OP_neg and DW_OP_plus require integral types of the operand
    (either
    integral base type or the generic type).

    To:

    Operations other than DW_OP_abs, DW_OP_div, DW_OP_mod, DW_OP_minus,
    DW_OP_mul, DW_OP_neg and DW_OP_plus require integral types of the
    operand (either integral base type or the generic type).

    Then append the following sentence to the description of the
    DW_OP_mod:

    The algorithm used to implement modulo shall be the one defined in
    The
    Art of Computer Programming Volume 1: Fundamental Algorithms
    Chapter 2
    Section 4. Knuth.

    Alternative proposals:

    1) Explicitly state in the standard that DW_OP_mod is only defined
    for
    unsigned integral arithmetic. This effectively standardizes the
    Michael
    Eager's pronouncement from 2011.

    2) Pick any algorithm for modulo that works for signed as well
    unsigned
    arithmetic and specify that DW_OP_modulo shall follow it. The current
    GDB implementation follows Knuth 1.2.4 for signed and unsigned
    integral
    arithmetic but excludes the algorithm for reals and floating point
    numbers.

--Dwarf-discuss mailing list

    [email protected]
    https://lists.dwarfstd.org/mailman/listinfo/dwarf-discuss

-- 
Dwarf-discuss mailing list
[email protected]
https://lists.dwarfstd.org/mailman/listinfo/dwarf-discuss

Re: [Dwarf-discuss] Clarification: DW_OP_mod doesn't specify which definition of modulo

Reply via email to