Re: [fpc-devel] FPC 2.3.1 seems a mixed mess with Unicode support

Jonas Maebe Wed, 16 Sep 2009 02:57:27 -0700


On 16 Sep 2009, at 11:44, Michael Schnell wrote:

Jonas Maebe wrote:


Analysing strings by hand not a very smart thing to do with unicode
strings.


How should it be avoided if I want to react on a user input or on a
string read from a file ?

Don't analyse them character by character, but use standard functionsto compare them. Any unicode support library worth its salt will offeryou many different ways to compare strings, because depending on thecontext you may need different ways:

a) the locale may matter (e.g., depending on whether "." means"decimal point" or "thousands separator", a comparison result may bedifferent)b) you have many different ways to order (unicode) strings. E.g.,these are the options that Apple's CFString comparison offers: <http://developer.apple.com/mac/library/documentation/CoreFoundation/Reference/CFStringRef/Reference/reference.html#//apple_ref/doc/constant_group/String_Comparison_Flags> (note that not all of those flags are about regular comparisons,and some of them are just for performance reasons). See in particularflags such as kCFCompareNonliteral, kCFCompareWidthInsensitive andkCFCompareLocalized.

This indeed causes problems with Pascal's generic comparisonoperators. I guess we will either have to define a particularbehaviour for them (presumably whatever CodeGear chose), add someglobal variable that you can set to influence the behaviour, or tellpeople to use CompareText() and friends (and probably add variantswith various options).

The upside of these complications (which have always existed, but mostpeople just ignored them and their programs only worked with one ortwo locales and/or encodings), is that if you deal with it properly inthe context of unicode, then your code will probably automaticallybehave "correctly" with many locales/scripts.



Jonas
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] FPC 2.3.1 seems a mixed mess with Unicode support

Reply via email to