Re: Major performance problem with std.array.front()

Andrea Fontana Mon, 10 Mar 2014 03:56:53 -0700

I'm not sure I understood the point of this (long) thread.
The main problem is that decode() is called also if not needed?


Well, in this case that's not a problem only for string. I found
this problem also when I was writing other ranges. For example
when I read binary data from db stream. Front represent a single
row, and I decode it every time also if not needed.

On Friday, 7 March 2014 at 02:37:11 UTC, Walter Bright wrote:

In "Lots of low hanging fruit in Phobos" the issue came upabout the automatic encoding and decoding of char ranges.
Throughout D's history, there are regular and repeatedproposals to redesign D's view of char[] to pretend it is notUTF-8, but UTF-32. I.e. so D will automatically generate codeto decode and encode on every attempt to index char[].
I have strongly objected to these proposals on the grounds that:

1. It is a MAJOR performance problem to do this.
2. Very, very few manipulations of strings ever actually needdecoded values.
3. D is a systems/native programming language, andsystems/native programming languages must not hide theunderlying representation (I make similar arguments aboutproposals to make ints issue errors on overflow, etc.).
4. Users should choose when decode/encode happens, not thelanguage.
and I have been successful at heading these off. But oneslipped by me. See this in std.array:
@property dchar front(T)(T[] a) @safe pure if(isNarrowString!(T[]))
  {
assert(a.length, "Attempting to fetch the front of an emptyarray of " ~
           T.stringof);
    size_t i = 0;
    return decode(a, i);
  }
What that means is that if I implement an algorithm thataccepts, as input, an InputRange of char's, it will ALWAYS tryto decode it. This means that even:
   from.copy(to)
will decode 'from', and then re-encode it for 'to'. And it willdo it SILENTLY. The user won't notice, and he'll just assumethat D performance sux. Even if he does notice, his options tomake his code run faster are poor.
If the user wants decoding, it should be explicit, as in:

    from.decode.copy(encode!to)
The USER should decide where and when the decoding goes.'decode' should be just another algorithm.
(Yes, I know that std.algorithm.copy() has some specializationsto take care of this. But these specializations would have tobe written for EVERY algorithm, which is thoroughlyunreasonable. Furthermore, copy()'s specializations only applyif BOTH source and destination are arrays. If just one is, thedecode/encode penalty applies.)
Is there any hope of fixing this?

Re: Major performance problem with std.array.front()

Reply via email to