Re: [dix] character encoding in draft-merrells-dix-00.txt

Dick Hardt Sat, 25 Feb 2006 10:41:38 -0800


On 25-Feb-06, at 10:05 AM, robert yates wrote:

Am no expert on character encodings, but are there a few issueswith the current draft and character encoding? Namely,
1) The fields dix:/message-type, dix:/membersite-url, dix:/signature and possibly others require Latin characters in theircontents. However isn't a browser free to post the form with acharacter encoding that does not permit these characters?2) The Canonicalization Algorithm relies on the browser posting theform in the same character encoding used by the server to generateit. With the current draft this can this be guaranteed?
This article http://www-306.ibm.com/software/globalization/misc/code_considerations/index.jsp provides for a good overview of theproblem, although may be a bit out of date. Having read this whatdo folks think of the following proposal to address the above issues.
The dix spec ensures that information is always exchanged (via thebrowser) encoded as UTF8. It does this by stating that thecharacter encoding of the forms pages must be UTF8 and alsomandates the following rules for the html forms used for the datatransportation.1. The HTML head section of the form MUST contain <META http-equiv="Content-Type" content="text/html; charset=utf-8">. Thereneeds to be a corresponding statement for XHTML.2. The FORM element MUST contain the Accept-Charset attribute andit MUST be set to 'utf-8'i.e. <FORM Accept-Charset="utf-8" Type= ...>

Great suggestion!

_______________________________________________
dix mailing list
[email protected]
https://www1.ietf.org/mailman/listinfo/dix

Re: [dix] character encoding in draft-merrells-dix-00.txt

Reply via email to