Re: [fpc-devel] Unicode resource strings

Hans-Peter Diettrich Mon, 20 Aug 2012 11:27:07 -0700

Graeme Geldenhuys schrieb:

On 20/08/12 08:52, Sven Barth wrote:
Just to avoid confusion: The reference counted 2-byte string type on all
platforms is UnicodeString, not WideString (the latter is not reference
counted on Windows platforms).
Please correct me if I am wrong, but I think WideString was referencecounted an all platforms "in the beginning" - like Martin mentioned.Later it was changed, and the new UnicodeString become the "referencecounted on all platforms" type.

WideStrings on Windows platforms are allocated in *system* space, sothat they can be used across processes. Reference counting can occuronly according to the Windows (COM) rules. Delphi UnicodeStrings arestored in the (local) program space instead, so that local referencecounting can be used. Dunno about passing such strings to otherprocesses, though.

The codepage aware string type was added to 2.7.1, because there already
existed a branch for this and "just" needed to be merged. There does not
yet exist any code for Unicode resource strings.
FPC's Unicode support is still in its infancy. It is not just resourcestrings that are missing. As my recent message from the fpc-usersmailing list shows.
Vital decisions of how Unicode should be implemented are still notdecided by the FPC team. There is a major problem in the FPC projectthough. The FPC team seems to be dead-locked on how to implement Unicodefeatures. Nobody can agree on anything. Thus no work can be started onthe RTL and FCL.
In the meant time many projects keep implementing there own Unicodeworkarounds. Not a good sign, but all we can do.

IMO UTF-8 is supported by all platforms, so that there exists no urgentneed for adding UTF-16 support. More problematic is the default "String"type break between older (AnsiString) and newer (UnicodeString) Delphiversions. The consequence of following *that* decision were incompatibleFCL (and LCL) classes, resulting in double maintenance efforts. Thisduplication can be avoided by using the implicit string conversions,offered by the new string types. This applies also to the handling ofresource strings. The runtime impact depends on the string model used ina *program*, where the use of UTF-16 strings would require manyconversions in *interfacing* UTF-8 components/libraries.

It's unclear whether UTF-16 strings really allow for faster stringhandling, since *full* Unicode support still has to take into accountUTF-16 *surrogate pairs*, no real difference vs. handling of UTF-8multibyte sequences.

So the BIG question remains: When will the FPC team sit down and hashout the details of implementing Unicode support? Please note, I'm notsaying "implement it", just saying... "agree on how it should beimplemented". If the FPC team stays in a dead-lock, then maybe thebetter option would be to allow the public to vote on it.


What special support do you expect?
Which of these features are essentially different for UTF-8 and UTF-16?

DoDi

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Unicode resource strings

Reply via email to