One thing may help you to think about this kind of issue is my 'under
constrution paper - Frank Tang's List of Common Bugs that Break Text
Integrity http://people.netscape.com/ftang/paper/textintegrity.html
I am going to present a newer revsion in the coming IUC25 if they accept
my proposal.
pepe pepe schreef:
We have the following sequence of characters ...izaciĆ³n Map..
that is
the same than ...izaci#243;n Map... that after suffering some
transformations becomes to ...izaci#56186;56333;ap
AS you can see the two characters 56186 and 56333 seem to
represent this
sequences
Pim Blokland wrote:
Not only that, but the process making the mistake of thinking it is
UTF-8 also makes the mistake of not generating an error for
encountering malformed byte sequences,
BTW, this process has a name: Internet Explorer.
AND of outputting the result as two 16-bit numbers
the client for saying that the doc is utf-8 or the parser.
Regards,
Mario.
From: Pim Blokland [EMAIL PROTECTED]
To: Unicode mailing list [EMAIL PROTECTED]
Subject: Re: Problems encoding the spanish o
Date: Mon, 17 Nov 2003 13:26:19 +0100
pepe pepe schreef:
We have the following sequence
From: Marco Cimarosti [EMAIL PROTECTED]
To: 'Pim Blokland' [EMAIL PROTECTED]; Unicode mailing list
[EMAIL PROTECTED]
Pim Blokland wrote:
Not only that, but the process making the mistake of thinking it is
UTF-8 also makes the mistake of not generating an error for
encountering malformed
Philippe Verdy verdy underscore p at wanadoo dot fr wrote:
If IE really wants to keep some compatibility, it may only accept the
CESU-8 encoding only as a possible choice for its automatic
selection of charsets, or display a visible replacement character
(such as a narrow white box) for
6 matches
Mail list logo