Re: ICU and Parrot

2002-06-01 Thread Dan Kogai
On Saturday, June 1, 2002, at 04:37 AM, Autrijus Tang wrote: > Understood. > > In a related note: > > http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html > > has spurred quite a bit discussion in Taiwan because of the mandated > standardization of Big5 => TCA-BIG5, and Big5-HKSCS => HKSCS-

Re: ICU and Parrot

2002-05-31 Thread Dan Kogai
On Saturday, June 1, 2002, at 12:34 AM, Autrijus Tang wrote: > On Fri, May 31, 2002 at 06:18:55AM +0900, Dan Kogai wrote: >> As a matter of fact GB18030 is ALREADY supported via Encode::HanExtra >> by >> Autrijus Tang. The only reason GB18030 was not included in Encode mai

Re: ICU and Parrot

2002-05-30 Thread Dan Kogai
On Friday, May 31, 2002, at 06:06 AM, George Rhoten wrote: > Hopefully you take the implicit information in the UCM files and put > that > into encode implementation too. For instance, in gb18030 there are > whole > ranges of Unicode mappings that aren't in the UCM file, but they are in > the

Re: [PATCH] Encode::MIME::Header

2002-05-20 Thread Dan Kogai
On Monday, May 20, 2002, at 11:39 AM, Tatsuhiko Miyagawa wrote: > charsets can include "_" in its name. Here's a patch. Thanks, applied. With patches from Autrijus and you I think I now I have enough diff to justify the version increment of Encode. Next version within 24 hours. Oh, VMS is st

Re: Acceptance of Unicode (UTF8) in Far East

2002-05-15 Thread Dan Kogai
On Thursday, May 16, 2002, at 03:04 AM, Mark Lewellen wrote: > Hi all- > I have a question directed mostly at those involved in the Far East. > Since Unicode is often implemented in UTF8, and UTF8 uses 3 bytes > for Chinese characters (instead of the 2 bytes in Chinese and Japanese > GB, Big5, J

Re: Encode::MIME::Header for perl 5.6.1

2002-05-15 Thread Dan Kogai
On Thursday, May 16, 2002, at 01:05 AM, Uri Cohen wrote: > I want to use Encode::MIME::Header but I only have perl 5.6.1 (from > ActiveState), which is the latest version for MSWindows. > Is it possible? I really need just this module. Well, you are welcome to port it if you really need it. So

Re: README.cjk?

2002-05-06 Thread Dan Kogai
Autrijus & Jungshik, pumpking is calling for the (hopefully) the last chance to update README.cjk. On Tuesday, May 7, 2002, at 02:48 , Jarkko Hietaniemi wrote: > Do I have the latest versions of the README.{cn,jp,ko,tw}? I do think so but I am calling for the last possible update anyhow. Hmm.

[Encode] 1.70 released

2002-05-06 Thread Dan Kogai
I should be relieved to release Encode-1.70 because djgpp problem should be resolved with this one, available as follows; Whole: http://www.dan.co.jp/~dankogai/Encode-1.70.tar.gz and CPAN Diff: 200 lines http://www.dan.co.jp/~dankogai/current-1.70.diff.gz And here are changes $

[Encode] encoding.pm made slightly more intuitive

2002-05-06 Thread Dan Kogai
On Monday, May 6, 2002, at 06:57 , Dan Kogai wrote: > On Monday, May 6, 2002, at 05:16 , Tatsuhiko Miyagawa wrote: >> panic happens while hacking with encoding pragma. > > It seems "use encoding" is still in effect after you 'use EncBar'. > Simply com

Re: [preannounce] Encode::Punycode

2002-05-06 Thread Dan Kogai
On Monday, May 6, 2002, at 07:11 , Tatsuhiko Miyagawa wrote: > I've just made Encode implementation for Punycode[1]. (Does it make > any sense to make such an encodings as subclass of Encode::Encoding? I > think it's reasonable, as there's Encode::MIME::Header!) I bet you do that sooner or later.

Re: use encoding in both scripts and modules

2002-05-06 Thread Dan Kogai
On Monday, May 6, 2002, at 07:01 , Tatsuhiko Miyagawa wrote: > yep, I know duplicate calls to "use encoding ..." is discouraged (as > written in docs). But is this "panic" thing as expected? That's a mystery needs to be solved but PerlIO is very sensitive to memory issues and that may be it -- i

Re: [PATCH] Encode::Encoding

2002-05-06 Thread Dan Kogai
On Monday, May 6, 2002, at 06:51 , Tatsuhiko Miyagawa wrote: > > package Encode::MyEncoding; > use base qw(Encode::Encoding); > > __PACKAGE__->Define(qw(myCanonical myAlias)); > > dies saying: > >Error: Undefined subroutine &Encode::define_encoding called at ... > > Patch follows after

Re: use encoding in both scripts and modules

2002-05-06 Thread Dan Kogai
On Monday, May 6, 2002, at 05:16 , Tatsuhiko Miyagawa wrote: > panic happens while hacking with encoding pragma. It seems "use encoding" is still in effect after you 'use EncBar'. Simply commenting out 'use encoding 'euc-jp'' in encoding-test.pl makes the program work as expected. Dan

Re: Need help writing a UCM file

2002-05-05 Thread Dan Kogai
On Monday, May 6, 2002, at 11:34 , Andreas J. Koenig wrote: > Dan, > > I have to work with an encoding that is not round trip safe. It has > both trema and umlaut, and it has been decided to map both to U0308 > (COMBINING DIAERESIS). So I thought I'd write > > \xC8 |0 # (12/8) Trema >

[Encode] 1.69 Released

2002-05-04 Thread Dan Kogai
I hope it was not too premature to release Encode-1.69, now available as follows. Whole: http://www.dan.co.jp/~dankogai/Encode-1.69.tar.gz and CPAN Diff against current: 180 lines http://www.dan.co.jp/~dankogai/current-1.69.diff.gz And here are Changes $Revision: 1.69 $ $Date:

[Encode] 1.68 Released

2002-05-03 Thread Dan Kogai
I am delighted to add the first female to AUTHORS when I released Encode, available as follows; Whole: http://www.dan.co.jp/~dankogai/Encode-1.68.tar.gz Diff against current: 106 lines http://www.dan.co.jp/~dankogai/current-1.68.diff.gz Changes is just one paragraph long. $Revi

InLanguage properties? [Was Re: Encode-InCharset-0.01 Released]

2002-05-03 Thread Dan Kogai
On Friday, May 3, 2002, at 04:33 , Roman Vasicek wrote: >> On Friday, May 3, 2002, at 02:41 , Dan Kogai wrote: >> >> I have just released Encode-InCharset-0.01, available as >> >> http://www.dan.co.jp/~dankogai/Encode-InCharset-0.01.tar.gz and CPAN. >> >&

Encode-InCharset-0.01 Released

2002-05-02 Thread Dan Kogai
e, the perlunicode manpage AUTHOR Dan Kogai <[EMAIL PROTECTED]> COPYRIGHT AND LICENSE Copyright 2002 by Dan Kogai This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See http://www.perl.com/perl/misc/Artistic.html

[Patch] User-defined \p{} more like Camel 3 example

2002-05-02 Thread Dan Kogai
jhi, I've submitted this yesterday but it seems it gets simply overlooked (got no positive or negative response) so here we go again. Dan On Thursday, May 2, 2002, at 12:44 , Dan Kogai wrote: > On Wednesday, May 1, 2002, at 11:23 , Jarkko Hietaniemi wrote: >> perlunicode.pod and

[Encode] 1.67 released

2002-05-02 Thread Dan Kogai
I wonder how's Laszlo's doing when I released Encode 1.67, available as follows. Whole: http://www.dan.co.jp/~dankogai/Encode-1.67.tar.gz and CPAN Diff agains current: 147 lines http://www.dan.co.jp/~dankogai/current-1.67.diff.gz And Changes. As you see changes are just cosmeti

Re: Change 16308: Encode tweak from Dan Kogai.

2002-05-01 Thread Dan Kogai
On Thursday, May 2, 2002, at 03:03 , Philip Newton wrote: > On Wed, 1 May 2002 09:45:05 -0700, [EMAIL PROTECTED] (Jarkko Hietaniemi) wrote: > >> if (check & ENCODE_DIE_ON_ERR) { >> Perl_croak( >> -aTHX_ "\"\\N{U+%" UVxf "}\" does not map to %s", >>

Re: Encode, charnames and utf8heavy

2002-05-01 Thread Dan Kogai
On Wednesday, May 1, 2002, at 11:23 , Jarkko Hietaniemi wrote: > perlunicode.pod and "User-defined Character Properties" already > documents it. I guess accepting \s+ is okay... but as I said, > people shouldn't be doing that by hand (much). And here is the patch that fixes this. [ \t]+ is pick

[Patch] ext/PerlIO/t/fallback.t gets haircut

2002-05-01 Thread Dan Kogai
jhi, > A bit of noise from ext/PerlIO/t/fallback.t: > > ./perl -Ilib ext/PerlIO/t/fallback.t > 1..8 > ok 1 - opened iso-8859-1 file > "\N{U+20ac}" does not map to iso-8859-1 at ext/PerlIO/t/fallback.t line > 21. > ok 2 - perlqq escapes > ok 3 - opened iso-8859-1 file > ok 4 - HTML escapes > ok 5

Re: Encode, charnames and utf8heavy

2002-05-01 Thread Dan Kogai
On Wednesday, May 1, 2002, at 11:04 , Jarkko Hietaniemi wrote: > Yes, it is. It's hack. (Regexps and a small cache. It *really* sucked > without that cache...) Oh yes. I had to say I almost got a hangover :P > (And I just remembered that viacode() returning an undef when there's > no corresp

Re: Encode, charnames and utf8heavy

2002-05-01 Thread Dan Kogai
On Wednesday, May 1, 2002, at 10:57 , Dan Kogai wrote: > Okay, I'll change the error message in the next one so it would say > > "\x{abcd}" does not map to iso-8859-1 at ext/PerlIO/t/fallback.t line > 21. > > Autrijus just sent me a patch so it won't t

Encode, charnames and utf8heavy

2002-05-01 Thread Dan Kogai
On Wednesday, May 1, 2002, at 10:30 , Jarkko Hietaniemi wrote: > Thanks, upgraded. > > A bit of noise from ext/PerlIO/t/fallback.t: > > ./perl -Ilib ext/PerlIO/t/fallback.t > 1..8 > ok 1 - opened iso-8859-1 file > "\N{U+20ac}" does not map to iso-8859-1 at ext/PerlIO/t/fallback.t line > 21. > ok

Re: [PATCH] Let Guess.pm handles uninitialized argument.

2002-05-01 Thread Dan Kogai
On Wednesday, May 1, 2002, at 09:19 , Autrijus Tang wrote: > This way is self-descriptory; it makes -w happier. :) > > /Autrijus/ XieXie. Applied. Dan the Encode Maintainer

[Encode] 1.66 Released

2002-04-30 Thread Dan Kogai
My fever is down at last when I released Encode-1.66, available as follows; Whole: http://www.dan.co.jp/~dankogai/Encode-1.66.tar.gz or CPAN Diff against current: 264 lines http://www.dan.co.jp/~dankogai/current-1.66.diff.gz And Changes. $Revision: 1.66 $ $Date: 2002/05/01 05:4

Re: [Encode] 1.65 released

2002-04-30 Thread Dan Kogai
On Wednesday, May 1, 2002, at 02:15 , Nick Ing-Simmons wrote: >> $Revision: 1.65 $ $Date: 2002/04/30 16:13:37 $ >> ! Encode.pm >> encode(undef) no longer warns for C> subroutine entry>. Suggested by Paul. > > Can I get warnings + fallbacks yet? Er, which one are you talking about? Hold on.

Encode should stay undefphobia

2002-04-30 Thread Dan Kogai
On Wednesday, May 1, 2002, at 02:10 , Nick Ing-Simmons wrote: > Dan Kogai <[EMAIL PROTECTED]> writes: > > Please don't. > > $a =~ tr/A/a/; > > gives a warning so should encode/decode. How can I be so dumb for not anticipating you say that! (Blame it on the fever)

[Encode] 1.65 released

2002-04-30 Thread Dan Kogai
I am wondering if Perl 5.8.0 will be integrated to FreeBSD 5 when I released Encode-1.65. FYI FreeBSD 5 release schedule is delayed a whole year from November 2001 to November 2002. We would definitely have Perl 5.8.0 with no trailing RC (right, Borg?) Whole: http://www.dan.co.jp/~da

Re: Encode doesn't like undef

2002-04-30 Thread Dan Kogai
On Tuesday, April 30, 2002, at 11:42 , Paul Marquess wrote: > I agree that passing undef() to one of the encoding functions may be an > edge > condition too far, but passing a variable that contains undef is more > common. > > $ perl5.7.3 -w -MEncode -e 'Encode::encode_utf8($a)' > Name "main::a"

Re: Encode doesn't like undef

2002-04-30 Thread Dan Kogai
On Tuesday, April 30, 2002, at 07:14 , Paul Marquess wrote: > This is with Encode 1.64 > > $ perl5.7.3 -w -MEncode -e 'Encode::encode_utf8(undef)' > Use of uninitialized value in subroutine entry at > /tmp/bleed/lib/perl5/5.7.3/sun4-solaris/Encode.pm line 183. > > I don't know Encode well enough t

[Encode] Encode-JIS2K-0.01 uploaded to CPAN

2002-04-30 Thread Dan Kogai
JIS X (0201-K|0213) Decode Only (|3) those only found in 0213 those only found in 0212 ------ AUTHORS Dan Kogai <[EMAIL PROTECTED]> COPYRIGHT Copyright 2002

Re: [Encode] euc-jp vs euc-jisx0213

2002-04-30 Thread Dan Kogai
On Monday, April 29, 2002, at 07:38 , SADAHIRO Tomoyuki wrote: > I doubt whether users of 'euc-jp' will > assume it to be a combination with JIS X 0213. They don't have to because 'euc-jp' behaves exactly the same as before so long as the charset is in ASCII/JISX(0201|0208|0212). > Such a mixin

[Encode] 1.64 Released

2002-04-29 Thread Dan Kogai
I am recovering from cold when I released Encode-1.64, avaiable as follows. Whole: http://www.dan.co.jp/~dankogai/Encode-1.64.tar.gz and CPAN Diff agains current: 16529 lines (because of new euc-jp.ucm) http://www.dan.co.jp/~dankogai/current-1.64.diff.gz Though no piece of code

[Encode] euc-jp vs euc-jisx0213

2002-04-28 Thread Dan Kogai
Sadahiro-san and perl-unicode readers, I am now working on Encode::JIS2K, an additional converter for JIS X 0213:2000. When I studied JIS X 0213, I found that for euc-jp, you can make a map so that it covers both JIS X 0212 and JIS X 0213. I thought they were mutually exclusive but they were

[Encode] 1.62 released

2002-04-27 Thread Dan Kogai
I can feel 5.8 is near when I released Encode 1.62, available as follows; Whole: http://www.dan.co.jp/~dankogai/Encode-1.62.tar.gz Diff Against Current: (Just 111 lines) http://www.dan.co.jp/~dankogai/current-1.62.diff.gz $Revision: 1.62 $ $Date: 2002/04/27 11:17:39 $ ! Encode.p

http://bleedperl.dan.co.jp:8080/

2002-04-27 Thread Dan Kogai
I have set up an experimental mod_bleedperl server which URI is shown in the subject. To demonstrate the power of Perl 5.8, I have written a small cgi/pl (.pl runs on Apache::Registry) called piconv.pl, a web version of piconv(1). http://bleedperl.dan.co.jp:8080/piconv/ (Don't forget :8080; it'

Unicode::Unihan 0.01 uploaded to CPAN

2002-04-26 Thread Dan Kogai
ny of such, tell me and I'll remove it from CPAN. Dan the Open Source Developer -- _ Dan Kogai __/ CEO, DAN co. ltd. /__ /-+-/ 2-8-14-418 Shiomi Koto-ku Tokyo 135-0052 Japan /--/--- mailto: [EMAIL PROTECTED] / http://www.dan.co.jp/ - __/ /Tel:+81 3-5665-6131

[Encode] 1.61 released

2002-04-25 Thread Dan Kogai
I know we are one more step closer to 5.8 when I released Encode 1.61, available as follows; Whole: http://www.dan.co.jp/~dankogai/Encode-1.61.tar.gz and CPAN Diff against current (840 lines) http://www.dan.co.jp/~dankogai/current-1.61.diff.gz And changes. $Revision: 1.

Re: Practical problems with custom .ucm based encoding

2002-04-24 Thread Dan Kogai
On Wednesday, April 24, 2002, at 10:07 , Bart Schuller wrote: > On Wed, Apr 24, 2002 at 09:56:29PM +0900, Dan Kogai wrote: >> Shoot! I've just implemented FB_HTMLENT ! (quick, wasn't it?) Okay, >> be >> it CHARREF (or isn't there a good short abbreviation fo

Re: Practical problems with custom .ucm based encoding

2002-04-24 Thread Dan Kogai
On Wednesday, April 24, 2002, at 09:43 , Bart Schuller wrote: > Character Reference is the proper term, for entities you'd need my whole > module. > Please go completely overboard and have FB_XMLCHARREF in addition to > FB_HTMLCHARREF, the difference being that the XML version would make it > € S

Re: Practical problems with custom .ucm based encoding

2002-04-24 Thread Dan Kogai
On Wednesday, April 24, 2002, at 09:25 , Bart Schuller wrote: > Hello, > > The cool Encoding support in 5.8 to be enables me to properly solve a > very common task: making HTML entities out of utf-8 data. > > I generated a ucm file with entries like this: > > \x26\x6E\x62\x73\x70\x3B

FYI: Encode performance on Japanese encodings

2002-04-23 Thread Dan Kogai
I was curious to find how fast or slow Encode is against popular Japanese transcoder modules. So I benchmarked and relieved that Encode's performance was good! I benchmarked it against Jcode.pm (mine, too) and jcode.pl (the first and still popular transcoder available since Perl4 by Utashiro-

Re: [Encode] new README.cjk available

2002-04-22 Thread Dan Kogai
On Sunday, April 21, 2002, at 11:38 , Shigeki Moro wrote: >> I think "$B%G%#!<%t%!%J%,!<%j(B" should be "$B%G!<%t%!%J!<%,%j!<(B". > > I'm sorry that was too short. This is a correction for /00README.jp/. First my apology for slow response; I was too focused on Encode-PerlIO issue. Now that

[Encode] 1.58 Released

2002-04-22 Thread Dan Kogai
I am sorry to release Encode-1.58 after less than 24 hours of the release of 1.57. Available at http://www.dan.co.jp/~dankogai/Encode-1.58.tar.gz and CPAN No code has changed. Even for documentation, only one type in TW/TW.pm was fixed. Then why 1.58? t/* has undergone majo

[Encode] 1.57 Released

2002-04-22 Thread Dan Kogai
I am still struggling to get a life when I released Encode-1.57, available as follows. Whole: http://www.dan.co.jp/~dankogai/Encode-1.57.tar.gz Diff (380 lines) http://www.dan.co.jp/~dankogai/current-1.57.diff.gz This is mainly to address "smoke when -Uuseperlio". On Monday, Ap

Re: Encode-1.56 and PerlIO-encoding 0.04 available

2002-04-22 Thread Dan Kogai
On Monday, April 22, 2002, at 07:33 , Nick Ing-Simmons wrote: > Dan Kogai <[EMAIL PROTECTED]> writes: >> >> There are two more encodings that don't work well w/ PerlIO. >> ISO-2022-KR and HZ. But neither is used ubiquitously (at least to the >> exten

Encode-1.56 and PerlIO-encoding 0.04 available

2002-04-22 Thread Dan Kogai
Porters, I am hoping to get a life with new Encode-1.56 and PerlIO-encoding 0.04. Available as folllows; Encode http://www.dan.co.jp/~dankogai/Encode-1.56.tar.gz and CPAN PerlIO::encoding http://www.dan.co.jp/~dankogai/PerlIO-encoding-0.04.tar.gz The biggest news

Re: [BIG PATCH] Encode docs

2002-04-21 Thread Dan Kogai
On Monday, April 22, 2002, at 06:20 , Philip Newton wrote: > Hi Dan & Jarkko, > > I had a look through the Encode docs and made a bunch of tweaks. Here's > a diff, first as straight text and also as a gzipped attachment (in case > stuff such as line wrapping or tabs get broken). > > I based the pa

Re: [big5-*.ucm] please revise if possible

2002-04-20 Thread Dan Kogai
On Sunday, April 21, 2002, at 02:32 , Autrijus Tang wrote: > Updated maps and test: > http://egb.elixus.org/~autrijus/big5-1.52.tgz > > Ucmlint still complains, due to the order issue outlined in the > previous mail. As you have intelligently found, the order for duplicate map DOES matter; |1 or

Re: Encode-1.50 +

2002-04-20 Thread Dan Kogai
On Sunday, April 21, 2002, at 04:50 , Nick Ing-Simmons wrote: > I just checked in these changes to ext/Encode/... as change 16022 > on perlio branch. To honor whitespaces, I usually rsync perl-core first then copy filesback to my repository for NI-XS (this works only for patches from those w/

[big5-*.ucm] please revise if possible

2002-04-20 Thread Dan Kogai
On Saturday, April 20, 2002, at 05:21 , Autrijus Tang wrote: > On Sat, Apr 20, 2002 at 08:00:04AM +0900, Dan Kogai wrote: >> Is this okay? I think this is due to the edition difference. If so, >> please submit a fixed version of TW.t > > Yes, as attached, thanks. HenBao!

[Encode] 1.51 Released

2002-04-20 Thread Dan Kogai
I was anticipating the release of 1.51 AFTER I get to bed and back. But my insomnia and earlier-than-expected responses from NI-XS and Autrijus have accelerated the release by at lease 6 hours :) Get it via http://www.dan.co.jp/~dankogai/Encode-1.51.tar.gz or CPAN. Though changes are small c

Re: Tk804 + Encode-1.50 :-) again

2002-04-19 Thread Dan Kogai
By the time you read this you must have just awake. On Saturday, April 20, 2002, at 07:07 , Nick Ing-Simmons wrote: > Near line 181 of Encode.xs (tail of main while loop): > > /* settle variables when fallback */ > dlen = SvCUR(dst); > d = (U8*)SvPVX(dst) + dlen; >

Re: [PATCH] Big5-related changes.

2002-04-19 Thread Dan Kogai
Autrijus, I now found that your new *.ucm smokes on t/TW.t > 1..17 > ok 1 - use Encode::TW; > ok 2 - [big5] decode - Basic Big5 range > ok 3 - [big5] encode - Basic Big5 range > ok 4 - [big5] from_to => utf8 - Basic Big5 range > ok 5 - [big5] utf8 => from_to - Basic Big5 range > ok 6 - [big5-hks

[Encode] Dark Side of the Emacs Modes [Was: Re: Tk804 ...]

2002-04-19 Thread Dan Kogai
On Saturday, April 20, 2002, at 05:38 , Nicholas Clark wrote: > On Sat, Apr 20, 2002 at 04:27:15AM +0900, Dan Kogai wrote: >> Yes, please. Emacs doesn't do spellcheck-as-you-type like recent >> mailers in MacOS and Windows :) (I know you can spellcheck in Emacs >> but

Re: [PATCH] Big5-related changes.

2002-04-19 Thread Dan Kogai
On Saturday, April 20, 2002, at 04:53 , Autrijus Tang wrote: > I've been immersed in Big5-related issues in the past few days, and > came back with these last-minute (err, week?) changes before 5.8-RC1. > > The Diff contains fixes to TW.pm, Alias.pm, and README.(tw|cn). Excellent! > (For dan) bi

Re: Tk804 + Encode-1.50 :-) again

2002-04-19 Thread Dan Kogai
On Saturday, April 20, 2002, at 03:45 , Nick Ing-Simmons wrote: > Dan Kogai <[EMAIL PROTECTED]> writes: >> I am daydreaming that I am a caravan member, driving a herd of >> disobedient camels on the never-ending desert to an oasis called 5.8.0 >> when I released new

Encode-1.50 and PerlIO::encoding 0.02 released

2002-04-18 Thread Dan Kogai
I am daydreaming that I am a caravan member, driving a herd of disobedient camels on the never-ending desert to an oasis called 5.8.0 when I released new Encode and PerlIO::encoding. You can get one as follows. Whole: Encode http://www.dan.co.jp/~dankogai/Encode-1.50.t

[Encode] new CHECK specifications

2002-04-18 Thread Dan Kogai
On Friday, April 19, 2002, at 05:01 , Nick Ing-Simmons wrote: > I am not sure when the change went in, but current Encode.xs > has broken Tk804. Ouch. > With $encoding->decode($string,1) > > now croaks if character does not map. Croaking is fine as a default > for checking but Tk would like a va

Re: HanExtra 0.04 up, and patchs

2002-04-18 Thread Dan Kogai
On Thursday, April 18, 2002, at 11:30 , Autrijus Tang wrote: > CCCII is introduced to HanExtra as of 0.04. Some patch to Encode here: > > * Removed CJKGuide reference. > * Removed CNS11643 reference. > * Added CCCII reference. > * Tag cleanup. Xiexie. Applied. Dan

Re: Please update Encode::HanExtra

2002-04-18 Thread Dan Kogai
On Thursday, April 18, 2002, at 04:40 , Autrijus Tang wrote: > On Thu, Apr 18, 2002 at 11:41:48AM +0900, Dan Kogai wrote: >> http://www.dan.co.jp/~dankogai/Encode-HanExtra-0.04.tar.gz >> Please pick it up, add necessary changes and upload YOUR version to >> CPAN. > >

Please update Encode::HanExtra

2002-04-17 Thread Dan Kogai
Autrijus, Now that enc2xs is standard in bleedperl and perl 5.8.0, you should update Encode::HanExtra at CPAN so that it uses enc2xs instead of old compile script (and more importantly, encode.h) I've made an updated version at http://www.dan.co.jp/~dankogai/Encode-HanExtra-0.04.tar.gz

[Encode] new README.cjk available

2002-04-17 Thread Dan Kogai
At http://www.dan.co.jp/~dankogai/bleedperl/ On Thursday, April 18, 2002, at 12:47 , Jarkko Hietaniemi wrote: > - Encode partial encoding issues [Dan, NI-S] > - Encode DJGPP failures [Dan] > - README.cjk updates [Dan] does this make this list one shorter? As for README.jp, I've left I18N part

Encode-1.42 & PerlIO-encoding-0.01 now available

2002-04-16 Thread Dan Kogai
NI-XS, jhi and porters, The surgical operation is finished. PerlIO layer functions in Encode.xs has been successfully detached. Now PerlIO part is in PerlIO::encoding. They are now more like interdependent than dependent. You can get one via URLs below; http://www.dan.co.jp/~dankogai/Perl

[Encode] 1.41 released

2002-04-16 Thread Dan Kogai
day, April 17, 2002, at 08:25 , Jarkko Hietaniemi wrote: >> On Wed, Apr 17, 2002 at 07:49:19AM +0900, Dan Kogai wrote: >> I will go ahead w/ the plan. I will release the next version with >> PerlIO part untouched to let us sync. Then the following version will >> detach the Pe

[Encode] All README.xx available at...

2002-04-15 Thread Dan Kogai
uys think. On Tuesday, April 16, 2002, at 08:44 , Jarkko Hietaniemi wrote: > On Tue, Apr 16, 2002 at 08:40:02AM +0900, Dan Kogai wrote: >> On Tuesday, April 16, 2002, at 08:14 , Jarkko Hietaniemi wrote: >>> Could I ask for the Japanese translation? (Check out Autrijus' lat

Re: README.jp (or README.jp?)

2002-04-15 Thread Dan Kogai
On Tuesday, April 16, 2002, at 08:14 , Jarkko Hietaniemi wrote: > Could I ask for the Japanese translation? (Check out Autrijus' latest > message about the subject, they had a useful additional section.) Sorry. I was too preoccupied w/ the module itself. Will be submitted before I go to bed.

Re: iso-2022-jp problem

2002-04-15 Thread Dan Kogai
On Tuesday, April 16, 2002, at 01:06 , Nick Ing-Simmons wrote: > So we need some way of telling from an encoding object (e.g. > an attribute or a method call) that it needs line buffering > so that :encoding layer can take the appropriate steps. Okay, which way do you like, attribute or method ?

Re: iso-2022-jp problem

2002-04-15 Thread Dan Kogai
On Tuesday, April 16, 2002, at 12:00 , Nick Ing-Simmons wrote: >> abcdcdefghijklmnopqrstu >> ^^error occurs here. >> >> What's the remaining stream? >> >> ghijklmnopqrstu > > Does not matter for that case. > "does not map" is a fatal error with $chk true (and would

Re: iso-2022-jp problem

2002-04-15 Thread Dan Kogai
On Monday, April 15, 2002, at 07:29 , Nick Ing-Simmons wrote: > I tracked down the "problem" tkmail was/is having with iso-2022-jp. > The snag is I am using the API the way I designed it, not the way > it is reliably implemented. > > When called thus: > > my $decoded = $enc->decode($encoded,1); >

[Encode] 1.40 released!

2002-04-14 Thread Dan Kogai
On Monday, April 15, 2002, at 07:00 , Dan Kogai wrote: > Folks, > > I will release ver. 1.40 of Encode after the smoke testings are > done. With In-XSimmons' XS version of Unicode transcoders, > encoding.pm enhancements and fixes (that led to "child gets cr

[Encode] 1.40 will be released in a few hours!

2002-04-14 Thread Dan Kogai
Folks, I will release ver. 1.40 of Encode after the smoke testings are done. With In-XSimmons' XS version of Unicode transcoders, encoding.pm enhancements and fixes (that led to "child gets croaked before born" bug discovery), and other nits picked, simple version increment is not enough

Re: README.jp, README.tw, README.cn, README.kr

2002-04-13 Thread Dan Kogai
On Sunday, April 14, 2002, at 05:38 , Sean M. Burke wrote: > At 23:30 2002-04-13 +0300, Jarkko Hietaniemi wrote: >> (You know what? Since of the files will be named README.xx and >> written in pod, the build machinery will automatically create >> the pod pages "perljp", "perltw", "perlcn", and "p

Re: README.jp, README.tw, README.cn, README.kr

2002-04-13 Thread Dan Kogai
On Sunday, April 14, 2002, at 05:15 , Jarkko Hietaniemi wrote: >> Should we mention piconv? Or this: > > piconv is good... It's going to be better on 1.35 -- I have noticed piconv can't use encodings that are added AFTER perl or Encode is installed (because of Encode::Config does not tweak %Ext

[Encode] 1.34 released only as diff

2002-04-12 Thread Dan Kogai
On Saturday, April 13, 2002, at 05:41 , Jarkko Hietaniemi wrote: > Could you rsync from AS and send me the diff? I've uploaded it as http://www.dan.co.jp/~dankogai/current-1.34.diff.gz Well, it was just 161 lines in total so I could've pasted here but for the sake of whitespaces use the one ab

Re: iso-2022-jp snags.

2002-04-11 Thread Dan Kogai
On Friday, April 12, 2002, at 02:30 , Nick Ing-Simmons wrote: > Having hacked RFC2047 support into tkmail I have now seen some > non-latin1 characters in a "real" perl/Tk app. > > There seem to be a few snags with mime's iso-2022-jp: > > - It failed to demand load given upper-case form ISO-2022-JP

Re: [PATCH] Encoder.pm

2002-04-11 Thread Dan Kogai
On Thursday, April 11, 2002, at 12:49 , Tatsuhiko Miyagawa wrote: > - Encoder->new($data)->iso_8859_1; # OOP way > + Encode::Encoder->new($data)->iso_8859_1; # OOP way Ooops. Thanks. Applied. Dan

[Encode] 1.33 released -- minuscule changes

2002-04-10 Thread Dan Kogai
I've got a feeling 5.8.0 will be a reality, not something like a horizon which is always there in front of you but you can never reach, when I release ver. 1.33 of Encode. Available as follows; Whole: http://www.dan.co.jp/~dankogai/Encode-1.33.tar.gz and CPAN Diff against curre

Re: My email address in the Encode AUTHORS file

2002-04-10 Thread Dan Kogai
On Thursday, April 11, 2002, at 04:30 , Philip Newton wrote: > Can you please change my email address to '[EMAIL PROTECTED]', please? > Thanks! > (That's also what's in the main Perl AUTHORS file.) Sure. Fixed. Dan

[Encode] Your patch applied

2002-04-10 Thread Dan Kogai
Anton, > - several typos > - excludes GBK from that section because > it is discussed in Microsoft-related > > Just routine. Thank you. Applied. Since this is purely of documentation, chances are it will go with 5.8.0-RC2 at least and should there be any code change BEFORE RC1, 5.8.0-RC1.

[Encode] to make -Uusecjk

2002-04-09 Thread Dan Kogai
On Wednesday, April 10, 2002, at 06:01 , Jarkko Hietaniemi wrote: > Yes, something like that could be easier to implement. Maybe > installperl could do the appropriate magic (i.e. skip the CJK)? > For static builds Configure needs tweaking, I think. Though I am still not sure of the irrelevance,

[Encode] (ir)?relevance of Configure -Uusecjk

2002-04-09 Thread Dan Kogai
On Wednesday, April 10, 2002, at 05:34 , Jarkko Hietaniemi wrote: > I'm afraid I've got one change at least coming up, I want to implement > the Configure -Uusecjk, so people with memory/disk footprint worries > can leave out the CJK of Encode. As soon as I can figure out how > to convince MM not

[Encode] 1.32 released

2002-04-09 Thread Dan Kogai
I am longing for the day when 5.8.0-RC1 be released when I uploaded version 1.32 of Encode. Get one via http://www.dan.co.jp/~dankogai/Encode-1.32.tar.gz Or CPAN. diff against current is also available as http://www.dan.co.jp/~dankogai/current-1.32.diff.gz And here are Changes. I would app

Re: an api

2002-04-09 Thread Dan Kogai
On Wednesday, April 10, 2002, at 03:58 , Jarkko Hietaniemi wrote: > > Encode::Alias::resolve_alias(), to go with define_alias()? Or cname() :? And of course, Encode->() is for UTF-128 :P Well, be it resolve_alias(). Dan the Network Consultant by Trade

Re: Encode failures when used as a static extension

2002-04-09 Thread Dan Kogai
Andy, On Wednesday, April 10, 2002, at 02:44 , Andy Dougherty wrote: > I occasionally use the following config.over file for testing: > > # config.over -- copy this to your perl source directory > static_ext=$dynamic_ext > dynamic_ext='' > > With it, I get the following results

Re: an api

2002-04-09 Thread Dan Kogai
On Wednesday, April 10, 2002, at 03:46 , Jarkko Hietaniemi wrote: > Is there a simple user-level API for > > "I have a string that might be an encoding name, >give me the canonical name" > > e.g. > > Encode::foobar('latin1') eq 'iso8859-1' > > If so (and I think there should be

[Encode] Memory Allocator bug in Encode.xs found and fixed

2002-04-09 Thread Dan Kogai
Porters, I have found a bug in Encode.xs that may cause "Out of Memory!" error in certain rare conditions. That is, when the memory allocator routine is called with no string processed, it asks perl for bogus (and random) amount of memory via SvGROW. I found this by accident when I was

[Encode] 3 patches

2002-04-09 Thread Dan Kogai
On Tuesday, April 9, 2002, at 10:39 , Nick Ing-Simmons wrote: > I _think_ that gets me back to where I was. Now I can see if I can get > jis0208 to work ... Thank you and your patches are applied flawlessly. > --- lib/Encode/Unicode.pm.shipTue Apr 9 14:28:13 2002 > +++ lib/Encode/Unicod

Re: [PATCH]s and questions [Encode] 1.30

2002-04-08 Thread Dan Kogai
On Tuesday, April 9, 2002, at 04:33 , Philip Newton wrote: > > This bit appears not to have been applied? > > Here it is again, together with another few tweaks to Encode::Unicode. Yikes. Too late for 1.31 but applied. Patch failed in two places but it was trivial to manually roll it back. T

[Encode] 1.31 released

2002-04-08 Thread Dan Kogai
On Tuesday, April 9, 2002, at 02:38 , Dan Kogai wrote: > Promised changes in 1.31 is right after my sig. I am grateful to release ver. 1.31 of Encode as promised, available http://www.dan.co.jp/~dankogai/Encode-1.31.tar.gz And CPAN. The diff against perl-current is just 707 li

[Encode] 1.31 in a few hours

2002-04-08 Thread Dan Kogai
On Tuesday, April 9, 2002, at 02:04 , Anton Tagunov wrote: > Other items in my '[PATCH]s and questions [Encode] 1.30' mail were: > > - a consmetic patch to Supported.pod This one must be the most acceptable by pumpkins since it has no piece of code :) > - a question whether > > Encoder.pm >

[Encode 1.30] Patch to correct BOM value for 32LE

2002-04-08 Thread Dan Kogai
jhi, The following patch will correct incorrect value for BOM for 32LE. The first one is essentially identical to that of Anton. And the second will fix t/Unicode.t so it is more independent of Encode::Unicode (that is, should there be an error there t/Unicode.t will find it -- currently

[Encode 1.30] BOM32LE was incorrect - fixed

2002-04-08 Thread Dan Kogai
Anton, On Monday, April 8, 2002, at 10:05 , Anton Tagunov wrote: > --- ext/Encode-1.30/lib/Encode/Unicode.pm.orig Mon Apr 8 14:06:28 2002 > +++ ext/Encode-1.30/lib/Encode/Unicode.pm Mon Apr 8 17:00:47 2002 > @@ -12,7 +12,7 @@ > sub FBCHAR(){ 0xFFFd } > sub BOM_BE(){ 0xFeFF } > sub BOM

[Encode] 1.30 released

2002-04-07 Thread Dan Kogai
I am too sleepy to abstain from releasing ver. 1.30 of Encode, available at http://www.dan.co.jp/~dankogai/Encode-1.30.tar.gz as well as CPAN. Since the diff against perl-current was only less than 700 lines, patch was made available as http://www.dan.co.jp/~dankogai/current-1.30-diff.gz Th

Re: A modest patch [Encode] 1.26

2002-04-07 Thread Dan Kogai
On Monday, April 8, 2002, at 08:33 , Anton Tagunov wrote: > Hello, Dan! > > Very modest: typos, C<>, wording, uhc, x-windows-949, Windows-31J > > /Anton/ Thanks. Applied. Will be appeared in the next release. Dan

[Encode] 1.26 Released

2002-04-07 Thread Dan Kogai
jhi and porters, I *must be* relieved to release version 1.26 of Encode. Available at http://www.dan.co.jp/~dankogai/Encode-1.26.tar.gz And CPAN. =h1 major changes * All (UCS-2|UTF-(16|32)(BE|LE)? are now supported. Will we support UTF-7 :? * jis02(01|08|12).ucm are back to m

[Encode] benchmark: UTF-16BE encoding/decoding

2002-04-07 Thread Dan Kogai
Good morning. I know NI-S is very concerned with the performance of UCS-2 (or UTF-16BE for the latest Java; It maps \x{1} or higher as SOB, oops, surrogate pair) so I decided to benchmark some. My latest (yet to be uploaded) Encode::Unicode includes two implementations. *_classic is

<    1   2   3   4   >