Re: Designing a format for research use of the PUA in a RTL mode (from Re: RTL PUA?)

Asmus Freytag Tue, 23 Aug 2011 11:14:10 -0700

On 8/23/2011 7:22 AM, Doug Ewell wrote:

Of all applications, a word processor or DTP application would want to
know more about the properties of characters than just whether they are
RTL.  Line breaking, word breaking, and case mapping come to mind.


I would think the format used by standard UCD files, or the XML
equivalent, would be preferable to making one up:


The right answer would follow the XML format of the UCD.

That's the only format that allows all necessary information containedin one file, and it would leverage of any effort that users of the mainUCD have made in parsing the XML format.

An XML format shold also be flexible in that you can add/remove not justcharacters, but properties as needed.

The worst thing do do, other than designing something from scratch,would be to replicate the UnicodeData.txt layout with its random, butfixed collection of properties and insanely many semi-colons. None ofthe existing UCD txt files carries all the needed data in a single file.

A./

Re: Designing a format for research use of the PUA in a RTL mode (from Re: RTL PUA?)

Reply via email to