Re: Why not flag away the mistakes of the past?

Guillaume Piolat via Digitalmars-d Fri, 09 Mar 2018 04:01:04 -0800

On Thursday, 8 March 2018 at 17:35:11 UTC, H. S. Teoh wrote:

Yeah, the only reason autodecoding survived in the beginningwas because Andrei (wrongly) thought that a Unicode code pointwas equivalent to a grapheme. If that had been the case, thecost associated with auto-decoding may have been justifiable.Unfortunately, that is not the case, which greatly diminishesmost of the advantages that autodecoding was meant to have. Soit ended up being something that incurred a significantperformance hit, yet did not offer the advantages it wassupposed to. To fully live up to Andrei's original vision, itwould have to include grapheme segmentation as well.Unfortunately, graphemes are of arbitrary length and cannot ingeneral fit in a single dchar (or any fixed-size type), andgrapheme segmentation is extremely costly to compute, so doingit by default would kill D's string manipulation performance.



I remember something a bit different last time it was discussed:

- removing auto-decoding was breaking a lot of code, it's usedin lots of place

 - performance loss could be mitigated with .byCodeUnit everytime
 - Andrei correctly advocating against breakage

Personally I do use auto-decoding, often iterating by codepoint,and uses it for fonts and parsers. It's correct for a largesubset of languages. You gave us a feature and now we are usingit ;)

Re: Why not flag away the mistakes of the past?

Reply via email to