Given the fact that

    static assert("é".length == 2);

I was surprised that

    static assert("é".byCodeUnit.length == 2);
    static assert("é".byCodePoint.length == 2);

Isn't there a way to iterate over accented characters (in my case UTF-8) in D? Or is this an inherent problem in Unicode? I need this in a syllable counting algorithm that needs to distinguish accented and non-accented variants of vowels. For example café (2 syllables) compared to babe (one syllable.

Reply via email to