Re: Make Encode.pm support the real UTF-8

2004-12-03 Thread Dan Kogai
On Dec 04, 2004, at 11:51, Larry Wall wrote: On Fri, Dec 03, 2004 at 10:12:12PM +, Tim Bunce wrote: : I've no problem with 'utf8' being perl's unrestricted uft8 encoding, : but "UTF-8" is the name of the standard and should give the : corresponding behaviour. For what it's worth, that's how I'v

Re: Make Encode.pm support the real UTF-8

2004-12-03 Thread Larry Wall
On Fri, Dec 03, 2004 at 10:12:12PM +, Tim Bunce wrote: : I've no problem with 'utf8' being perl's unrestricted uft8 encoding, : but "UTF-8" is the name of the standard and should give the : corresponding behaviour. For what it's worth, that's how I've always kept them straight in my head. Als

Re: About HTML unicode

2004-12-03 Thread Ben Morrow
Quoth [EMAIL PROTECTED] (John Delacour): > At 12:31 am +0800 3/12/04, He Zhiqiang wrote: > > >Now i encountered another problem, there are a few files contains > >not only one charset but also two or more, for example, file1 > >contains japanese and chinese, if i use open() to load the data

Re: About HTML unicode

2004-12-03 Thread Ben Morrow
Quoth [EMAIL PROTECTED] (He Zhiqiang): > > I've a problem to convert a unicode character into it's decimal or > Hexadecimal value. The following URL: > http://code.cside.com/3rdpage/us/unicode/converter.html > can easily convert via Javascript function escape(), but i wonder that is > there s

Re: Make Encode.pm support the real UTF-8

2004-12-03 Thread Tim Bunce
On Sat, Dec 04, 2004 at 04:06:46AM +0900, Dan Kogai wrote: > On Dec 02, 2004, at 23:25, Tim Bunce wrote: > >On Wed, Dec 01, 2004 at 01:28:05PM -0800, Gisle Aas wrote: > >>As you probably know perl's version of UTF-8 is not the real thing. I > >>thought I would hack up a patch to support the encodi

[Encode] 2.09 released!

2004-12-03 Thread Dan Kogai
Porters, I have just released Encode-2.09, AKA GAAS special. Gisle, thank you for all the reports and patches. Wish they were done before 5.8.6 :) =head1 Availability http://www.dan.co.jp/~dankogai/cpan/Encode-2.09.tar.gz or CPAN near you =head1 Changes $Revision: 2.9 $ $Date: 2004/12/03 19:16

Re: Make Encode.pm support the real UTF-8

2004-12-03 Thread Dan Kogai
On Dec 02, 2004, at 23:25, Tim Bunce wrote: On Wed, Dec 01, 2004 at 01:28:05PM -0800, Gisle Aas wrote: As you probably know perl's version of UTF-8 is not the real thing. I thought I would hack up a patch to support the encoding as defined by Unicode. That involves rejecting illegal chars (like s

Re: About HTML unicode

2004-12-03 Thread John Delacour
At 12:31 am +0800 3/12/04, He Zhiqiang wrote: Now i encountered another problem, there are a few files contains not only one charset but also two or more, for example, file1 contains japanese and chinese, if i use open() to load the data into memory, ord and length etc.. can't correctly work!