Myrna van Lunteren <[EMAIL PROTECTED]> writes: > On 7/28/06, Knut Anders Hatlen <[EMAIL PROTECTED]> wrote: >> All of the master files are 7 bit ASCII files, but they are read as >> UTF-8 (which works since ASCII characters have the the same encoding >> in UTF-8 as in US-ASCII). Some of the tests in i18n/* output non-ASCII >> characters, but Sed.java replaces them with >EnC charcode<, so they >> are also 7 bit ASCII. >> >> -- >> Knut Anders >> > Thx for correcting my earlier pronouncements...
Elaborating, not correcting. :) I just wanted to emphasize that even though they sometimes are treated as UTF-8, ISO-8859-1 or Cp1252, they in fact don't contain any characters outside US-ASCII, and therefore it doesn't matter which of these encodings we use. -- Knut Anders
