On Thu, Jul 17, 2008 at 5:20 AM, Allison Randal <[EMAIL PROTECTED]> wrote:
> The thing is, there's a tendency for data for a particular program or > application to all be from the same character set (if, for example, you're > parsing a series of files, munging the data in some way, and writing out a > series of files as a result). We never want to force all data to be > transformed into one "canonical" character set, because it significantly But that it's not my proposal. The proposal is to consider that all texts are already unicode, just encoded in his particular way. And there is no need to transform it unless asked, the same way that utf8 does not need to be converted to utf16 or utf32 if not asked. But better I'll leave this discussion, and not reopen it without first preparing a detailed proposal. -- Salu2