On Mon, Sep 11, 2000 at 08:25:37PM -0700, Ed Batutis wrote:
> >Please read Encode.pm. Mainly I'm interested hearing comments whether
> >this is a good interface...
>
> I like the interface. No complicated options.
>
> It might be more useful if the default for the non-utf-8 characters
> were the system-defined default character encoding of the process
> -rather than iso latin 1. I suspect there are various difficulties
> with this, but it would be more useful for international users.
I can understand the request but the problem is that for this to work
the legacy eight-bit mappings must first be implemented. (E.g. if
the native character encoding is DOS CPXXXX, or Latin 2, or KOI8-R,
before we can hope to map it to utf8, we must map it the full UCS-2
Unicode anyway...)
> Perhaps it would be difficult to use ICU as a utility library for
> Perl, but it seems on the surface to make sense. It is open source
> (some flavor thereof anyway). There are a lot of converters
> implemented and they are very fast. There is a mechanism to allow
> loading only the converter tables that are required. (There's also
> lots of locale support stuff there too, which might also be useful.)
Yes, this is noted down in the file Todo-5.6. The biggest problem is that
the ICU will not be everywhere.
--
$jhi++; # http://www.iki.fi/jhi/
# There is this special biologist word we use for 'stable'.
# It is 'dead'. -- Jack Cohen