Re: UAX 29 questions

2015-01-30 Thread Philippe Verdy
2015-01-30 9:32 GMT+01:00 Mark Davis ☕️ : > 2. Also, the following 2 rules are not equivalent: > > a) Any × (Format | Extend) > b) X (Extend | Format)* → X > That's what I replied in the first message but using an "as if" which was not clear enough, my seconde reply reformulated it by making cle

Re: UAX 29 questions

2015-01-30 Thread Mark Davis ☕️
I apology in advance that I'm running low on time, and didn't go through all the messages on this thread carefully. So I may not be fully appreciating people's positions. I'm just making some quick points about 2 items that caught my eye. 1. There are certainly times where two rules in sequence m

Re: UAX 29 questions

2015-01-29 Thread Philippe Verdy
The main reason is that the rest if the text does not test pairs starting by Format or Extend, but Any character that precedes the Format and Extend characters. By saying "ignore"; it just says : whilae parsing from start to ed of text, keep any character in the stqte variable that keeps the WB-pro

Re: UAX 29 questions

2015-01-29 Thread Karl Williamson
On 01/29/2015 08:19 PM, Philippe Verdy wrote: 2015-01-29 19:52 GMT+01:00 Karl Williamson mailto:pub...@khwilliamson.com>>: Rule WB4 is "Ignore Format and Extend characters, except when they appear at the beginning of a region of text.". Not clearly stated, but it appears to me

Re: UAX 29 questions

2015-01-29 Thread Philippe Verdy
2015-01-29 19:52 GMT+01:00 Karl Williamson : > Rule WB4 is > > "Ignore Format and Extend characters, except when they appear at the > beginning of a region of text.". > > Not clearly stated, but it appears to me that the ZWJ must be considered > here to be the beginning of a region of text, as we

Re: UAX 29 questions

2015-01-29 Thread Karl Williamson
On 01/25/2015 05:14 AM, Philippe Verdy wrote: This is not a contradiction. At the very least it is too sloppy for a standard. Once there is a match in the list of rules, later rules shouldn't have to be looked at. I'll submit a formal feedback form. But there is another issue as well. I

Re: UAX 29 questions

2015-01-25 Thread Philippe Verdy
This is not a contradiction. combine the two rules and they are equivalent to these two alternate rules: WB56 can be read as these two: (WB56a) ALetter × (MidLetter | MidNumLet | Single_Quote) (ALetter | Hebrew_Letter) (WB56b) Hebrew_Letter × (MidLetter | MidNumLet | Single_Quote) (ALette

Re: UAX 29 questions

2015-01-24 Thread Richard Wordingham
On Sat, 24 Jan 2015 23:26:09 -0700 Karl Williamson wrote: > But the earlier rule, WB6 > > (ALetter | Hebrew_Letter) × (MidLetter | MidNumLet > | Single_Quote) (ALetter | Hebrew_Letter) > > seems to me to say (among other things) that a Hebrew Letter followed > by a Single Quote should

UAX 29 questions

2015-01-24 Thread Karl Williamson
I vaguely recall asking something like this before, but if so, I didn't save the answers, and a search of the archives didn't turn up anything. Some of the rules in UAX #29 don't make sense to me. For example, rule WB7a Hebrew_Letter × Single_Quote seems to say that a Hebrew_Le