Re: Problems encoding the spanish o

2003-11-19 Thread Frank Yung-Fong Tang
One thing may help you to think about this kind of issue is my 'under constrution paper - Frank Tang's List of Common Bugs that Break Text Integrity http://people.netscape.com/ftang/paper/textintegrity.html I am going to present a newer revsion in the coming IUC25 if they accept my proposal.

Re: Problems encoding the spanish o

2003-11-17 Thread Pim Blokland
pepe pepe schreef: We have the following sequence of characters ...izaciĆ³n Map.. that is the same than ...izaci#243;n Map... that after suffering some transformations becomes to ...izaci#56186;56333;ap AS you can see the two characters 56186 and 56333 seem to represent this sequences

RE: Problems encoding the spanish o

2003-11-17 Thread Marco Cimarosti
Pim Blokland wrote: Not only that, but the process making the mistake of thinking it is UTF-8 also makes the mistake of not generating an error for encountering malformed byte sequences, BTW, this process has a name: Internet Explorer. AND of outputting the result as two 16-bit numbers

Re: Problems encoding the spanish o

2003-11-17 Thread pepe pepe
the client for saying that the doc is utf-8 or the parser. Regards, Mario. From: Pim Blokland [EMAIL PROTECTED] To: Unicode mailing list [EMAIL PROTECTED] Subject: Re: Problems encoding the spanish o Date: Mon, 17 Nov 2003 13:26:19 +0100 pepe pepe schreef: We have the following sequence

Re: Problems encoding the spanish o

2003-11-17 Thread Philippe Verdy
From: Marco Cimarosti [EMAIL PROTECTED] To: 'Pim Blokland' [EMAIL PROTECTED]; Unicode mailing list [EMAIL PROTECTED] Pim Blokland wrote: Not only that, but the process making the mistake of thinking it is UTF-8 also makes the mistake of not generating an error for encountering malformed

Re: Problems encoding the spanish o

2003-11-17 Thread Doug Ewell
Philippe Verdy verdy underscore p at wanadoo dot fr wrote: If IE really wants to keep some compatibility, it may only accept the CESU-8 encoding only as a possible choice for its automatic selection of charsets, or display a visible replacement character (such as a narrow white box) for