On Thu, 14 Jun 2001, Edward Peschko wrote:

> 
> Ok, I'm a bit confused..
> 
> How exactly do you add a new charset map to Unicode::Map? Where do you get the 
> encodings from? Where are they defined?
> 
> I saw your reference to ftp://ftp.unicode.org/MAPPINGS, but that just points
> to a file, not a directory of mapping sets.
> 
> All I'm trying to do is convert from UTF8 to iso-2022-jp ( the form of shift
> jis that is used in email...) any help on how to do this would be greatly 
> appreciated...

Install 'Unicode::MapUTF8' - it probably does what you want:

my $sjis_string = from_utf8({ -string => $utf8_string, 
                             -charset => 'iso-2022-jp' })

Alternatively, install the 'Jcode' module (Unicode::MapUTF8 forms a
'wrapper' around that and other Unicode modules to provide a single
consistent interface for _all_ Unicode charset convertors).

> (ps - the charset that I'm talking about can be found at:
> 
> http://java.sun.com/j2se/1.3/docs/guide/intl/encoding.doc.html
> 
> It would be really, really cool if perl had the same charset codes, or at least
> an alias to them. That way, one wouldn't have to go through this 'is the charset
> there' junk. Unfortunately there seems to be 10 aliases for charsets all over
> the place.

Yah. That problem is being addressed in the I18N::Charset module. I
intend eventually to make Unicode::MapUTF8 aware of that so it can exploit
the known aliases information.
 
-- 
Benjamin Franz

"Premature optimization is the root of all evil in programming."
                                         ---C.A.R. Hoare


Reply via email to