At 22:57 5/12/2002, Amir Herman wrote:

>If this is the case, why in Unicode it have Arabic Presentation A & C to
>present the final, medial, and initial form of Arabic characters?

Ah, this is an historical oddity. Character inclusion in Unicode is 
governed by a number of principles, which are not necessarily mutually 
inclusive. So the 'purity' of the character encoding model is sometimes 
sacrificed in the interests of another principle, such as the need to 
provide ono-to-one backwards compatibility mappings with older character 
sets. So Unicode effectively inherits some of the poor design or technical 
limitations of pre-existing character sets. I'm sure Ken Whistler or one of 
the other UTC members can explain exactly why the Arabic presentation forms 
ended up being encoded. I seem to recall that there was some political 
pressure (from Egypt?) that resulted in the unfortunate encoding of an 
arbitrary subset of potential ligature forms, but I don't know the details 
or whether this pressure was also responsible for the inclusion of the 
final, medial and initial forms.

Some existing software, e.g. Adobe PageMaker and InDesign ME (made by 
WinSoft in France), makes use of the presentation form encodings to enable 
a very basic kind of Arabic shaping. Developers of such software to whom I 
have spoken seem to be aware that this is a bit of a hack, and that a more 
sophisticated and elegant Arabic shaping methodology requires only the 
codepoints in the basic Arabic block. Everything else can be handled at the 
glyph processing stage.

John Hudson

Tiro Typeworks          www.tiro.com
Vancouver, BC           [EMAIL PROTECTED]

Last words of Jesuit grammarian Dominique Bouhours:
"I am about to — or I am going to — die; either expression is used."


Reply via email to