Re: A simpler definition of the Bidi Algorithm

Asmus Freytag Sun, 17 Oct 2010 10:30:02 -0700

 On 10/17/2010 7:01 AM, Michael D. Adams wrote:

This is something that not even the C++ and Java reference
implementations do (though it appears that the C++ implementation of
the W rules was originally derived from a regular expression as it
uses state tables, but if so it is undocumented).  (Which by the way
they have not been proven to be equivalent, they have merely been
tested.  Proof is a much more complicated formalism.)

Having written the C++ reference implementation, I know a thing or twoabout it :)

The two implementations were not formally proven to be equivalent - in away that wasn't the purpose. The purpose was to see whether several (inthis case two) implementers could read the rules and come up with thesame results.

When someone creates a real-world implementation to a specification likethe Bidi Algorithm, it's not usually verified by formal proof, but bytesting. Therefore, the exercise had to with finding out what level oftesting was sufficient to capture inadvertent misapplication of some ofthe less-well-worded rules. (Some of them have since been rewritten tomake their intent less ambiguous).

Most of the issues were found with the test pass that simply comparedall possible sequences up to length 6. That is better than theBidiTest.txt file, which I understand only goes to length 4. Stochasticsampling of sequences up to length 20 resulted in fewer reporteddiscrepancies - again, all of this is from memory.

For the test, the maximal depth of embeddings was set to 15 instead of63, and the input were strings of bidi classes, not raw characters -therefore cutting down on the number of possible sequences.

The Java implementation was deliberately designed to be transparent -matching the way the rules are formulated in an obvious way. For the C++implementation I wanted to do something different, and possibly faster,so I hand-coded a few state tables. The biggest challenge was not increating those tables, but in understanding the nuances of the rules, bythe way.

The situation today is not fully comparable, since there was somefeedback from the reference implementation project to the rules andespecially their wording.

A./

Re: A simpler definition of the Bidi Algorithm

Reply via email to