Re: [fpc-devel] String and UnicodeString and UTF8Stringt

LacaK Tue, 11 Jan 2011 22:18:19 -0800

...: the new ansistring type has a hidden "element size" field (inaddition to the reference count, length and codepage), and from what Ican see at page 10 ofhttp://edn.embarcadero.com/article/images/38980/Delphi_and_Unicode.pdf,Delphi 2009's unicodestring is simply an ansistring(1200).

So it seems, that if we will have any "GenericString", with properties"reference count", "size", "character width", "codepage", then all otherstring types can be based on this string type. So other strings will beonly any "shortcuts", and internaly will use same structure:AnsiString = GenericString(with actual system ANSI code page (0) ... or... without any explicit codepage ($ffff))

UTF8String = GenericString(with UTF-8 encoding)
UnicodeString = GenericString(with UTF-16 encoding)

So it seems to me, that there is agreement on adding "character width","codepage" to internal "string" record structure and provide conversionswhere needed, isn't it ? (more or less same approach like in Delphi)

Where is not agreement, it is fact what should be default stringencoding (AnsiString($ffff) or UTF-8 or UTF-16 or UTF-32)

So if I revert to my original question ... is there any agreement onsome points related to "future of String type" ?

P.S. I still does not understand, how can things work correctly if LCLexpect that all AnsiStrings (String) are UTF8Strings, byt RTL/FCL doesnot strictly follow this (at least in Windows) ?


-Laco.
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] String and UnicodeString and UTF8Stringt

Reply via email to