Re: PDD 4: Internal data types

2001-03-06 Thread Dan Sugalski
At 04:14 PM 3/5/2001 -0800, Hong Zhang wrote: Here is an example, "re`sume`" takes 6 characters in Latin-1, but could take 8 characters in Unicode. All Perl functions that directly deal with character position and length will be sensitive to encoding. I wonder how we should handle this

Re: PDD 4: Internal data types

2001-03-06 Thread Hong Zhang
Unless I really, *really* misread the unicode standard (which is distinctly possible) normalization has nothing to do with encoding, I understand what you are trying to say. But it is not very easy in practice. The normalization has something to do with encoding. If you compare two strings

Re: PDD 4: Internal data types

2001-03-06 Thread Dan Sugalski
At 01:21 PM 3/6/2001 -0800, Hong Zhang wrote: Unless I really, *really* misread the unicode standard (which is distinctly possible) normalization has nothing to do with encoding, I understand what you are trying to say. But it is not very easy in practice. The normalization has something to