... pcre takes utf8 strings
... utf-16 is supported by RegexKitLite & lib ICU

... NSString and CFString are implemented as utf-16

On 6 Jun 2008, at 16:02, Jason Stephenson wrote:

Replying to myself here, which I know is generally a bad thing, but this thought just came to me.

I have yet to find a regex library that handles UTF-16 well, if at all. I actually spent a couple of hours yesterday trying to mangle some UTF-16 files in Perl using regular expressions. I gave up and did it in Emacs, the only environment where I've seen REs handle UTF16 properly.

So, that's now my mission, to come up with a RE library that handles UTF16 as gracefully as 7 bit ASCII.

Cheers,
Jason
_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/dreamcat7%40googlemail.com

This email sent to [EMAIL PROTECTED]

_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to [EMAIL PROTECTED]

Reply via email to