7F is a legal XML 1.0 character and XML 1.0 should accept it. And, yes, I
believe that in UTF8 (are you SURE you're reading the file as UTF8 rather
than some other encoding?) it should be a legitimate single byte.

However, the XML 1.0 spec's section 2.2 says "Document authors are
encouraged to avoid "compatibility characters", as defined in section 6.8
of [Unicode] (see also D21 in section 3.6 of [Unicode3])." So using this
character is ill-advised, even though it is legal.

XML 1.1 does generally accept more characters than XML 1.0 does -- but note
that 7F is one of the RestrictedChars, and that XML 1.1 explicitly says
these may not appear within documents or external parsed entities. (Alas,
the Recommendation does not explain why these are restricted.)

Looks to me like you have several choices: Eliminate that character, encode
it somehow (note that a numeric character reference probably wouldn't solve
this problem), or go back to XML 1.0.

______________________________________
"... Three things see no end: A loop with exit code done wrong,
A semaphore untested, And the change that comes along. ..."
  -- "Threes" Rev 1.1 - Duane Elms / Leslie Fish
(http://www.ovff.org/pegasus/songs/threes-rev-11.html)

Reply via email to