On 6/7/2010 4:26 PM, Masaaki Shibata wrote:
I'm studying the UAX #14 (5.2.0) and testing my code against
LineBreakTest.txt. And I found some test cases on this text file seem
to be contradictory to the rules on the document.

For example, LB25 explicitly prohibits breaking between CP and PO,
while LineBreakTest.txt says "÷ [0.2] RIGHT PARENTHESIS (CP) ÷ [999.0]
PERCENT SIGN (PO) ÷ [0.3]" (l. 1137).

I'm not a Unicode expert; which rules lead to the result like this?
Did I miss any important descriptions on the document?
Probably not. The test file has been known to be wrong before.

The spec clearly states that breaks are only allowed if there are spaces,
as in:

CP SP+ ÷ OP

So this line in the "test" file appears incorrect.

A./




Reply via email to