Re: char array weirdness

Jack Stouffer via Digitalmars-d-learn Wed, 30 Mar 2016 10:36:06 -0700

On Wednesday, 30 March 2016 at 05:16:04 UTC, H. S. Teoh wrote:

If we didn't have autodecoding, would be a simple matter ofsearching for sentinel substrings. This also indicates thatmost of the work done by autodecoding is unnecessary -- it'swasted work since most of the string data is treated opaquelyanyway.

Just to drive this point home, I made a very simple benchmark.Iterating over code points when you don't need to is 100x slowerthan iterating over code units.


import std.datetime;
import std.stdio;
import std.array;
import std.utf;
import std.uni;

enum testCount = 1_000_000;

enum var = "Lorem ipsum dolor sit amet, consectetur adipiscingelit. Praesent justo ante, vehicula in felis vitae, finibustincidunt dolor. Fusce sagittis.";


void test()
{
    auto a = var.array;
}

void test2()
{
    auto a = var.byCodeUnit.array;
}

void test3()
{
    auto a = var.byGrapheme.array;
}

void main()
{
    import std.conv : to;
    auto r = benchmark!(test, test2, test3)(testCount);
    auto result = to!Duration(r[0] / testCount);
    auto result2 = to!Duration(r[1] / testCount);
    auto result3 = to!Duration(r[2] / testCount);

    writeln("auto-decoding", "\t\t", result);
    writeln("byCodeUnit", "\t\t", result2);
    writeln("byGrapheme", "\t\t", result3);
}


$ ldc2 -O3 -release -boundscheck=off test.d
$ ./test
auto-decoding           1 μs
byCodeUnit              0 hnsecs
byGrapheme              11 μs

Re: char array weirdness

Reply via email to