"Unicode in 'NFG' formation" ?

John M. Dlugosz Sat, 16 May 2009 18:39:24 -0700

I was going over S02, and found it opens with, "By default Perl presentsUnicode in "NFG" formation, where each grapheme counts as one character."

I looked up NFG, and found it to be an invention of this group, butdidn't find any details when I tried to chase down the links.

This opens a whole bunch of questions for me. If you mean that thedefault for what the individual items in a string are is graphemes, OK,but what does that have to do with parsing source code? Even so, that'snot something that would be called a Normalization Form.

Character set encodings and stuff is one of my strengths. I'd like tostraighten this out, and can certainly straighten out the wording, butfirst need to know what you meant by that.


Can someone catch me up on the particulars?

--John

"Unicode in 'NFG' formation" ?

Reply via email to