Re: Major performance problem with std.array.front()

Sarath Kodali Fri, 07 Mar 2014 14:41:57 -0800

On Friday, 7 March 2014 at 20:43:45 UTC, Vladimir Panteleev wrote:

On Friday, 7 March 2014 at 19:57:38 UTC, Andrei Alexandrescuwrote:
Allow me to enumerate the functions of std.algorithm and howthey work today and how they'd work with the proposed change.Let s be a variable of some string type.
s.canFind('é') currently works as expected.
No, it doesn't.

import std.algorithm;

void main()
{
    auto s = "cassé";
    assert(s.canFind('é'));
}
That's the whole problem - all this hot steam and it still doesnot work properly. Because it can't - not without pulling inall of the Unicode algorithms implicitly, and that would bemuch worse.
I went down std.algorithm in the order listed in itsdocumentation and found pernicious issues with almost everysingle algorithm.
All of your examples are variations of one and the same case:searching for a non-ASCII dchar or dchar literal.
How often does this pattern occur in real programs? I think theonly real metric is to try the change and find out.
Clearly one might argue that their app has no business dealingwith diacriticals or Asian characters. But that's the typicalprovincial view that marred many languages' approach to UTFand internationalization.
So is yours, if you think that making everything magically adchar is going to solve all problems.
The TDPL example only showcases the problem. Yes, it works withSwedish. Now try it again with Sanskrit.

+1

In Indian languages, a character consists of one or more UNICODEcode points. For example, in Sanskrit "ddhrya"http://en.wikipedia.org/wiki/File:JanaSanskritSans_ddhrya.svgconsists of 7 UNICODE code points. So to search for this char Ihave to use string search.


- Sarath

Re: Major performance problem with std.array.front()

Reply via email to