Re: Japanese chars and ARGV

2006-12-02 Thread Chris Wagner
Ok I think I've got this figured out. utf8::decode() does what u want. The bytes in @bytes represent the constituent octets in "tesuto" in Kana. Using utf::decode successfully turned the 9 bytes into 3 characters. Let me know if this gets what u need. @bytes = split /\|/, "e3|83|86|e3|82|b9|e

Re: Japanese chars and ARGV

2006-12-02 Thread Eric Amick
On Sat, 02 Dec 2006 12:00:12 -0800, you wrote: >On a Japanese version of Windows when you execute a Perl to run a script, the >length() fcn returns >the wrong number of characters for anything you pass in as @ARGV[0], and the >split() fcn seems to >work the same way. > >Using some of the samples

Re: Japanese chars and ARGV

2006-12-02 Thread Chris Wagner
Eh, I don't think this is right. I'm mixing up the code point numbers with the numeric value of the constituent bytes. I'll keep looking though. At 07:58 PM 12/1/2006 -0500, Chris Wagner wrote: >The S format turns ur 6 bytes into 3 integers(machine native). U can then >feed those 3 integers to