Re: The Case Against Autodecode

tsbockman via Digitalmars-d Thu, 02 Jun 2016 14:56:15 -0700

On Thursday, 2 June 2016 at 21:38:02 UTC, default0 wrote:

On Thursday, 2 June 2016 at 21:30:51 UTC, tsbockman wrote:
1) It does not say that level 2 should be opt-in; it says thatlevel 2 should be toggle-able. Nowhere does it say which oflevel 1 and 2 should be the default.
2) It says that working with graphemes is slower than UTF-16code UNITS (level 1), but says nothing about streamingdecoding of code POINTS (what we have).
3) That document is from 2000, and its claims aboutperformance are surely extremely out-dated, anyway. Computersand the Unicode standard have both changed much since then.
1) Right because a special toggleable syntax is definitely not"opt-in".

It is not "opt-in" unless it is toggled off by default. The onlyreason it doesn't talk about toggling in the level 1 section, isbecause that section is written with the assumption that manyprograms will *only* support level 1.

2) Several people in this thread noted that working ongraphemes is way slower (which makes sense, because its yetanother processing you need to do after you decoded - thereforemore work - therefore slower) than working on code points.

And working on code points is way slower than working on codeunits (the actual level 1).

3) Not an argument - doing more work makes code slower.


What do you think I'm arguing for? It's not graphemes-by-default.

What I actually want to see: permanently deprecate theauto-decoding range primitives. Force the user to explicitlyspecify whichever of `by!dchar`, `byCodePoint`, or `byGrapheme`their specific algorithm actually needs. Removing the implicitconversions between `char`, `wchar`, and `dchar` would also benice, but isn't really necessary I think.

That would be a standards-compliant solution (one of severalpossible). What we have now is non-standard, at least going bythe old version Walter linked.

Re: The Case Against Autodecode

Reply via email to