Re: [fpc-devel] ThousandSeparator

Hans-Peter Diettrich Tue, 25 Nov 2014 13:29:33 -0800

Mattias Gaertner schrieb:

Does concatenating a string and a WideChar create a UnicodeString? Can
this become a problem?

Concatenation requires 2 strings, so everything depends on the concretecode. Regardless of eventual compiler magics, something like this willhappen:


var c: WideChar; s, cs: string;
cs := c; //dunno if accepted by the compiler
s := s + cs;

The WideChar can be converted into an Unicode (UTF-8 or UTF-16) string.Afterwards this string may need another conversion, when the otherstring has a different encoding. In the worst case *both* strings areconverted to the default Unicode representation (Delphi: UTF-16,Lazarus: UTF-8?), before they are concatenated. Another conversion mayoccur when the resulting string is assigned to a variable.

All this may become simpler when CP_ACP is used (at least in Delpi), andthe separator is given in that encoding, as a single byte/AnsiChar incase of an SBCS CP_ACP. When Lazarus instead uses UTF-8 (MBCS) forCP_ACP, the character occupies more than one byte, so that thissimplification is impossible. This suggests to store the delimiter as anstring, instead of a WideChar, whereupon a concatenation of the stringsmay not require any further conversion.

Finally, when the expression (s+cs) is of type RawByteString (dependingon the involved function declarations), the result will be stored in thetarget variable *without* another conversion. Then the static anddynamic encoding of s may be different afterwards.


DoDi

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

Re: [fpc-devel] ThousandSeparator

Reply via email to