On Dec 04, 2004, at 11:51, Larry Wall wrote:
On Fri, Dec 03, 2004 at 10:12:12PM +, Tim Bunce wrote:
: I've no problem with 'utf8' being perl's unrestricted uft8 encoding,
: but "UTF-8" is the name of the standard and should give the
: corresponding behaviour.
For what it's worth, that's how I'v
On Fri, Dec 03, 2004 at 10:12:12PM +, Tim Bunce wrote:
: I've no problem with 'utf8' being perl's unrestricted uft8 encoding,
: but "UTF-8" is the name of the standard and should give the
: corresponding behaviour.
For what it's worth, that's how I've always kept them straight in my head.
Als
Quoth [EMAIL PROTECTED] (John Delacour):
> At 12:31 am +0800 3/12/04, He Zhiqiang wrote:
>
> >Now i encountered another problem, there are a few files contains
> >not only one charset but also two or more, for example, file1
> >contains japanese and chinese, if i use open() to load the data
Quoth [EMAIL PROTECTED] (He Zhiqiang):
>
> I've a problem to convert a unicode character into it's decimal or
> Hexadecimal value. The following URL:
> http://code.cside.com/3rdpage/us/unicode/converter.html
> can easily convert via Javascript function escape(), but i wonder that is
> there s
On Sat, Dec 04, 2004 at 04:06:46AM +0900, Dan Kogai wrote:
> On Dec 02, 2004, at 23:25, Tim Bunce wrote:
> >On Wed, Dec 01, 2004 at 01:28:05PM -0800, Gisle Aas wrote:
> >>As you probably know perl's version of UTF-8 is not the real thing. I
> >>thought I would hack up a patch to support the encodi
Porters,
I have just released Encode-2.09, AKA GAAS special. Gisle, thank you
for all the reports and patches. Wish they were done before 5.8.6 :)
=head1 Availability
http://www.dan.co.jp/~dankogai/cpan/Encode-2.09.tar.gz
or CPAN near you
=head1 Changes
$Revision: 2.9 $ $Date: 2004/12/03 19:16
On Dec 02, 2004, at 23:25, Tim Bunce wrote:
On Wed, Dec 01, 2004 at 01:28:05PM -0800, Gisle Aas wrote:
As you probably know perl's version of UTF-8 is not the real thing. I
thought I would hack up a patch to support the encoding as defined by
Unicode. That involves rejecting illegal chars (like s
At 12:31 am +0800 3/12/04, He Zhiqiang wrote:
Now i encountered another problem, there are a few files contains
not only one charset but also two or more, for example, file1
contains japanese and chinese, if i use open() to load the data
into memory, ord and length etc.. can't correctly work!