Michael Schnell schrieb:
On 06/22/2012 01:17 PM, Felipe Monteiro de Carvalho wrote:
Yes, but in practice the only place that I know that uses everything in decomposed unicode is the Mac OS X filesystem. So for dealing with Mac OS X filenames we need special care and for the rest just suppose composed mode (but remembering that some chars have no composed mode!).

Anyway it might be helpful to note this in the wiki page at http://wiki.lazarus.freepascal.org/LCL_Unicode_Support#Searching_a_substring.

The text "For searching in a substring they do work perfectly" might cause misunderstanding on that behalf. (Of course it technically does work fine, but the ambiguity of Unicode itself introduces potential problems).

IMO the meaning is clear: searching for an sub*string* means an *exact* match. Who wants case insensitive or otherwise inexact matches, has to use specialized *text* functions.

Ambiguous character representations are a general problem with characters, where e.g. line endings or whitespace can be represented in different ways. The accent/umlaut composition is not a Unicode problem, it occurs in all codepages.

DoDi


--
_______________________________________________
Lazarus mailing list
Lazarus@lists.lazarus.freepascal.org
http://lists.lazarus.freepascal.org/mailman/listinfo/lazarus

Reply via email to