LDC: Constant Folding Across Nested Functions?

dsimcha Sat, 18 May 2013 10:40:23 -0700

Background: This came from an attempt to get rid of delegateindirection in parallel foreach loops on LDC. LDC can inlinedelegates that always point to the same code. This means that itcan inline opApply delegates after inlining opApply itself andeffectively constant folding the delegate.


Simplified case without unnecessarily complex context:


// Assume this function does NOT get inlined.
// In my real use case it's doing something
// much more complicated and in fact does not
// get inlined.
void runDelegate(scope void delegate() dg) {
    dg();
}

// Assume this function gets inlined into main().
uint divSum(uint divisor) {
    uint result = 0;

    // If divisor gets const folded and is a power of 2 then
    // the compiler can optimize the division to a shift.
    void doComputation() {
        foreach(i; 0U..1_000_000U) {
            result += i / divisor;
        }
    }

    runDelegate(&doComputation);
}

void main() {
    // divSum gets inlined, to here, but doComputation()
    // can't because it's called through a delegate.
    // Therefore, the 2 is never const folded into
    // doComputation().
    auto ans = divSum(2);
}

The issue I'm dealing with in std.parallelism is conceptually thesame as this, but with much more context that's irrelevant tothis discussion. Would the following be a feasible compileroptimization either in the near future or at least in principle:

When an outer function is inlined, all non-static inner functionsshould be recompiled with the information gained by inlining theouter function. In this case doComputation() would be recompiledwith divisor const-folded to 2 and the division optimized to ashift. This post-inlining compilation would then be passed torunDelegate().

Also, is there any trick I'm not aware of to work around thestandard compilation model and force this behavior now?

LDC: Constant Folding Across Nested Functions?

Reply via email to