200-600x slower Dlang performance with nested foreach loop

methonash via Digitalmars-d-learn Tue, 26 Jan 2021 09:45:29 -0800

Greetings Dlang wizards,

I seek knowledge/understanding of a very frustrating phenomenonI've experienced over the past several days.


The problem space:

1) Read a list of strings from a file
2) De-duplicate all strings into the subset of unique strings

3) Sort the subset of unique strings by descending length andthen by ascending lexicographic identity4) Iterate through the sorted subset of unique strings,identifying smaller sequences with perfect identity to theirlargest possible parent string

I have written a Dlang program that performantly progressesthrough step #3 above. I used a built-in AA (associative array)to uniquely de-duplicate the initial set of strings and then usedmultiSort(). Performance was good up till this point, especiallywith use of the LDC compiler.

Things went sideways at step #4: because multiSort() returns aSortedRange, I used .array to convert the returned SortedRangeinto an array of type string[]. This appeared to work, andneither DMD nor LDC threw any warnings/errors for doing this.

With the formally returned array, I then attempted to construct adouble foreach loop to iterate through the sorted array of uniquestrings and find substring matches.


foreach( i, ref pStr; sortedArr )
{
    foreach( j, ref cStr; sortedArr[ i + 1 .. $ ] )
    {
        if( indexOf( pStr, cStr ) > -1 )
        {
            // ...
        }
    }
}

Before adding the code excerpt above, the Dlang program wastaking ~1 second on an input file containing approx. 64,000strings.

By adding the code above, the program now takes 6 minutes tocomplete. An attempt was made to more efficiently performASCII-only substring searching by converting the sorted string[]into ubyte[][] and then using countUntil() instead of indexOf(),but this had an effect that was completely opposite to what I hadpreviously experienced: the program then took over 20 minutes tocomplete!


Thus, I am entirely baffled.

My first attempt to solve this problem space used a small Perlprogram to perform steps 1 through 3, which would then pipeintermediate output to a small Dlang program handling only step#4 using dynamic arrays (no use of AAs) of ubyte[][] with use ofcountUntil().

The Dlang code for the nested foreach block above is essentiallynear-identical between my two Dlang implementations. Yet, thesecond implementation--where I'm trying to solve the entireproblem space in D--has absolutely failed in terms of performance.


Perl+D, ubyte[][], countUntil() :: under 2 seconds
only D, string[], indexOf() :: ~6 minutes
only D, ubyte[][], countUntil() :: >20 minutes

Please advise. This nightmarish experience is shaking myconfidence in using D.

200-600x slower Dlang performance with nested foreach loop

Reply via email to