Re: Normalize an NSAttributedString

Ken Thomases Wed, 26 Aug 2009 10:23:55 -0700

On Aug 26, 2009, at 10:43 AM, Michael Ash wrote:

On Wed, Aug 26, 2009 at 5:42 AM, Ken Thomases<k...@codeweavers.com>wrote:
On Aug 25, 2009, at 7:21 PM, Ross Carter wrote:
I haven't tried it, but this should work:

       NSAttributedString* original = whatever;
NSMutableAttributedString* normalized = [[originalmutableCopy]
autorelease];
       CFMutableStringRef str = (CFMutableStringRef)[original
mutableString];
       CFStringNormalize(str, kCFStringNormalizationFormD);
This works because -[NSMutableAttributedString mutableString] isa proxy
that automatically fixes up the attribute runs held by its owner.
Hmm, this seems dangerous in the sense that the conversion may belossy. Asfar as I can see, there's no guarantee that CFStringNormalize willperformminimal replacements. If it does not, then whole ranges ofcharacters may
have their attributes reset to that of the first replaced character.
Even if testing reveals it to be non-lossy under one testingenvironment,without a guarantee that might differ under any other testingenvironment.
http://en.wikipedia.org/wiki/Unicode_equivalence

[... quote snipped ...]

I'm well aware of what it means. The question is, which exactoperations on the mutable string proxy does CFStringNormalizeperform. If CFStringNormalize performs the minimal replace operationsto get the result, then it will preserve the attributes closely. It'sconceivable, though, that CFStringNormalize uses a side buffer tocompute the normalized form and then does one big replace of the wholemutable string's range. Or, anywhere in between. Like, it mightreplace a series of precomposed characters with their decompositionsall with one replace operation. In that case, the attributes of mostof the characters will be lost (replaced with the attributes of thefirst character in the replace range).

So, it's clear that the _strings_ will always have a deterministicvalue as a result of normalization. That's the point ofnormalization. But the _attributed strings_ may not.

Also, it should be self-evident that normalizing to a precomposedform willobliterate attribute differences between a base character and anycombining
characters, as discussed elsewhere in this thread.
Good thing he went and normalized to a *de*composed form then, isn'tit?

Martin's example used Form D, but Ross never quite said that's what hewas normalizing to. He might have been adapting Martin's example butusing a different form.


Regards,
Ken

_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to arch...@mail-archive.com

Re: Normalize an NSAttributedString

Reply via email to