Precomposed Character & Grapheme on wikipedia

spir Tue, 25 Jan 2011 08:51:05 -0800

Hello,

I stepped on wikipedia's articlehttp://en.wikipedia.org/wiki/Precomposed_character which is, imo, excellent.(It does not (yet) cope with consequences in programming with Unicode that wedebated on this list.)A enigmatic point is "Precomposed characters are the legacy solution forrepresenting many special letters in various character sets." I still fail tosee how precomposed characters help in solving issues posed by texts encoded inlegacy characters sets (since they need be decoded anyway). Explanation welcome.

This article brought me to http://en.wikipedia.org/wiki/Grapheme. Seems I waspartially wrong in stating that using "grapheme" to denote what we commonlythink as a character is an error. Possibly "grapheme" in english and "graphème"in french are not quite synonym. For instance, "ph" is commonly regarded as asingle grapheme in french (<--> phoneme /f/ indeed), so that grapheme andchracter are not at all synonyms; while according to en-wikipedia's article itmay be 2 in english. What do you think?Still remains the point that the notion of grapheme only applies to elements ofscripting systems (letters, syllables...), used to write 'words'. What we needis a term which, just like "character" in the context of computing, both forusers and programmers, englobes thingies like tabulation or newline marks,copyright or paragraph signs, and much more... even the null character ;-)."Grapheme" is usable provided it is clearly defined as meaning that, precisely,in the context of UCS/Unicode. What Unicode literature & and literature aboutUnicode do not do, AFAIK. Else, it is just adding confusion over confusion.


Denis
--
_________________
vita es estrany
spir.wikidot.com

Precomposed Character & Grapheme on wikipedia

Reply via email to