DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16693>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16693

HTML serializer doesn't take into account the output encoding





------- Additional Comments From [EMAIL PROTECTED]  2003-02-03 15:01 -------
According to section 16.2 of the XSLT Recommendation [1], non-ASCII characters 
in URI attribute values should be escaped using the method recommended in 
Section B.2.1 of the HTML 4.0 Recommendation [2].  The latter recommends that 
non-ASCII characters be represented in UTF-8 prior to applying the "%HH" 
escaping described by the URI RTF, regardless of the output encoding.

The simple rationale for that is that relying on the encoding of the document 
to determine the encoding of the escaped value would mean that, if the document 
was subsequently trancoded into another encoding, all escaped characters in 
URI's would have to be similarly decoded, transcoded, and then "%HH" escaped 
again.

Are you running into a problem with using the URI's that are escaped in this 
way?

[1] http://www.w3.org/TR/xslt#section-HTML-Output-Method
[2] http://www.w3.org/TR/REC-html40/appendix/notes.html#h-B.2.1

Reply via email to