On Tue, 12 Oct 2004 20:25:16 +0200, Philippe Verdy <[EMAIL PROTECTED]> wrote: > From: "Doug Ewell" <[EMAIL PROTECTED]> > > Theodore H. Smith <delete at elfdata dot com> wrote: > > > >>> - the file mixes UTF-8 and UTF-16 > >> > >> Does this file mix UTF-8 and UTF-16? I thought it just had surrogates > >> encoded into UTF-8? Of course a surrogate should never exist in UTF-8. > > > > You are right. Philippe's statement was incorrect, and also puzzling. > > Have you read the file content? It clearly and explicitly speaks about > UTF-16,
Only in the context of surrogates, and then only in the context of illegal UTF-8 sequences. The file doesn't contain any UTF-16 encoded text. > unless the file was used as a test for CESU-8 The whole point of the CESU-8-like section is that it is not legal UTF-8. -- Clark S. Cox III [EMAIL PROTECTED] http://www.livejournal.com/users/clarkcox3/ http://homepage.mac.com/clarkcox3/

