Accented Characters and Counting Syllables

Nordlöw Sat, 06 Dec 2014 14:41:03 -0800

Given the fact that

    static assert("é".length == 2);


I was surprised that

    static assert("é".byCodeUnit.length == 2);
    static assert("é".byCodePoint.length == 2);

Isn't there a way to iterate over accented characters (in my caseUTF-8) in D? Or is this an inherent problem in Unicode? I needthis in a syllable counting algorithm that needs to distinguishaccented and non-accented variants of vowels. For example café (2syllables) compared to babe (one syllable.

Accented Characters and Counting Syllables

Reply via email to