Re: Character sets PDD ready for review

Allison Randal Tue, 01 Apr 2008 17:23:17 -0700

Leopold Toetsch wrote:

1) The Parrot internal character type
«Strings in Parrot's native string format will probably be an array of"Parrot_Rune"s.»
or iso-8859-1 or UCS-2.

To be more accurate: Parrot has *no* native string format. It storesstrings in whatever format you give it (including iso-8859-1, UCS-2,ASCII, etc). And, it stores them as a string buffer, not an array of anytype of character.

2) the concept of Parrot_Rune or

<cite>
Unicode codepoint where values >= 0x80000000 are
       understood to be entries into the global "Parrot_grapheme_table" array.
</cite>

seems to be implying that we are gonna starting to:

a) rewrite / improve the now used ICU library
b) inventing a new "standard"
c) will do a lot of future hiring work to keep in sync with unicode folks ;-)

Basically I have some concerns "who will implement and maintain it".

Agreed that would be bad, but I don't think Simon intended that.Regardless, the current spec is only for an additional normalizationform added on top of the existing Unicode Standard. No changes to ICU,just another way of interacting with strings, whatever format theyhappen to be in.


Allison

Re: Character sets PDD ready for review

Reply via email to