Alert: Encode alias implementation is buggy

2002-03-20 Thread Dan Kogai
Encode Hackers, I have found a big bug in alias implementation; you can't override alias later on! Suppose there is a user who wants to use vendor encoding. S/he may Encode::define_alias( qr/sjis$/i => '"cp932"'); # Windoze or die! But you will still get shiftjis. I check

Re: Fwd: [PATCH][docs] Encode.pm

2002-03-20 Thread Markus Kuhn
Just for the record: The correct way to spell an ISO standard is to have a space between ISO and its number. Correct: ISO 8859-1. Other spellings such as ISO-8859-1 or ISO_8859-1 are merely workarounds for situations, where spaces are inconvenient. That's why MIME uses ISO-8859-1. In documentatio

Re: Fwd: GB 1833 name abiguous

2002-03-20 Thread Autrijus Tang
On Thu, Mar 21, 2002 at 08:05:21AM +0800, Autrijus Tang wrote: > in which I proposed to rename gb2312 to gb2312-raw to avoid the > ambiguity. The *other* 8-bit MIME encoding is euc-cn. > > And yes, this is in disagreement to iconv and hc's conventions, in > which gb2312 is an alias to euc-cn,

Re: Fwd: GB 1833 name abiguous

2002-03-20 Thread Autrijus Tang
On Wed, Mar 20, 2002 at 05:50:42PM +0300, Anton Tagunov wrote: > What we have is an ambigeous name. If we were meaning the 8 bit > encoding (that has a MIME name) it should have been GB2312, not > GB 2312. This has been raised before, at: http:[EMAIL PROTECTED]/msg00819.html > But we have the 7-

Re: No phonetic scripts and still happy?

2002-03-20 Thread Autrijus Tang
On Thu, Mar 21, 2002 at 03:24:34AM +0900, Dan Kogai wrote: > >: Not as far as I know. Hiragana / Katakana is exclusively a Japanese > >: thing ain't it? > >Yes. Bopomofo is the Chinese equivalent. > Not quite. Bopomofo is more like international phonetic symobols; use > in education but not i

Re: Encode touched

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 08:01 , Jarkko Hietaniemi wrote: > Change 15378 by jhi@alpha on 2002/03/20 21:57:22 > > Subject: [PATCH] > From: "Paul Marquess" <[EMAIL PROTECTED]> > Date: Wed, 20 Mar 2002 22:56:17 - > Message-ID: <[EMAIL PROTECTED]> > > pa

Re: ext/Encode/t/CJKalias.t

2002-03-20 Thread Jarkko Hietaniemi
On Thu, Mar 21, 2002 at 08:04:15AM +0900, Dan Kogai wrote: > On Thursday, March 21, 2002, at 07:53 , Jarkko Hietaniemi wrote: > > Change 15377 by jhi@alpha on 2002/03/20 21:50:11 > > > > A plan is better. > > > > Affected files ... > > > > ... //depot/perl/ext/Encode/t/CJKalias.t#3 edit > >

Re: ext/Encode/t/CJKalias.t

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 07:53 , Jarkko Hietaniemi wrote: > Change 15377 by jhi@alpha on 2002/03/20 21:50:11 > > A plan is better. > > Affected files ... > > ... //depot/perl/ext/Encode/t/CJKalias.t#3 edit Thank you. Applied. Come to think of it, there is no reason no to te

Re: No phonetic scripts and still happy?

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 05:54 , Nicholas Clark wrote: > By "Alphabet" you mean the Roman Alphabet? Yes. Roman alphabet that is. But Greek Alphabet is no stranger here in Japan (They all exist in JIS X 0208). Any high school students have to learn to spell theta to mean angle (but

Encode-0.94 uploaded

2002-03-20 Thread Dan Kogai
Encode Hackers, I have just uploaded Encode-0.94 as http://www.dan.co.jp/~dankogai/Encode-0.94.tar.gz as well as CPAN. Here are changes. 0.94 Thu Mar 21 2002 + lib/Encode/Description.pod ! lib/Encode/Encoding.pm Now the pod in Encode.pm is abridged as programming references. lib/

Re: [PATCH][docs] Encode.pm

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 03:49 , Jarkko Hietaniemi wrote: > I don't think perlencode would work, a perl*.pod for a module is not > our habit. Maybe a subpod Encode::Naming, or something like that? Okay, then how about Encode::Description, with pod file stored as lib/Description.pod? Dan

Re: Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 03:49 , Nick Ing-Simmons wrote: > Dan Kogai <[EMAIL PROTECTED]> writes: > > All this talk of Encode in CPAN and incrementing versions reminded me > that I messed with some :encoding parts of Encode.xs the other day. > (For PerlIO reasons). > > Is it appropriate to h

Re: [PATCH][docs] Encode.pm

2002-03-20 Thread Jarkko Hietaniemi
>I will apply the patch then revise the pod. Well, Encode.pm's pod > seems too long now. I think the general question should be exported > to, say, perlencode.pod and Encode.pm's pod slim down to API > documentations. What do you say, jhi? I don't think perlencode would work, a perl*.p

Re: [PATCH][docs] Encode.pm

2002-03-20 Thread Dan Kogai
On Wednesday, March 20, 2002, at 11:48 , Anton Tagunov wrote: > With my upgraded knowledge of encoding naming I propose this. > > Justification: > > 1) > Shift-JIS -> Shift_JIS does not hurt anyone, cause it does > not work either way, Encode::encode >

Re: Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 03:29 , Jarkko Hietaniemi wrote: > There seems to be still references to ::Extended in the pod... Removed. New diff as follows; > rcsdiff -u lib/Encode/Tcl.pm === RCS file: lib/Encode/Tcl.pm,v re

Re: Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Jarkko Hietaniemi
On Thu, Mar 21, 2002 at 03:28:49AM +0900, Dan Kogai wrote: > On Thursday, March 21, 2002, at 02:51 , Jarkko Hietaniemi wrote: > > You really need to have this since Sadahiro's patch removed > > Tcl::Extended... > > > > Change 15336 by jhi@alpha on 2002/03/19 18:41:50 > > > > Begone, Encode::Tc

Re: Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 02:51 , Jarkko Hietaniemi wrote: > You really need to have this since Sadahiro's patch removed > Tcl::Extended... > > Change 15336 by jhi@alpha on 2002/03/19 18:41:50 > > Begone, Encode::Tcl::Extended. Thank you. Applied. Proof as follows; > rcsdiff -u

No phonetic scripts and still happy?

2002-03-20 Thread Dan Kogai
On Thursday, March 21, 2002, at 01:14 , Larry Wall wrote: > Jean-Michel Hiver writes: > : > Do Chinese use Katakana? Does such encoding make > : > sense? > : > : Not as far as I know. Hiragana / Katakana is exclusively a Japanese > : thing ain't it? > > Yes. Bopomofo is the Chinese equivalent.

Re: Fwd: Do Chinese use Katakana? What is 1988.enc (was: ISO-8859-1 vs ISO 8859-1 )

2002-03-20 Thread Andreas Marcel Riechert
Jean-Michel Hiver <[EMAIL PROTECTED]> writes: > > Do Chinese use Katakana? Does such encoding make > > sense? > > Not as far as I know. Hiragana / Katakana is exclusively a Japanese > thing ain't it? In Chinese script Hiragana and Katakana are not used. While it is a exclusively Japanese [wri

Re: Fwd: Do Chinese use Katakana? What is 1988.enc (was: ISO-8859-1 vs ISO 8859-1 )

2002-03-20 Thread Larry Wall
Jean-Michel Hiver writes: : > Do Chinese use Katakana? Does such encoding make : > sense? : : Not as far as I know. Hiragana / Katakana is exclusively a Japanese : thing ain't it? Yes. Bopomofo is the Chinese equivalent. Larry

Re: Fwd: Do Chinese use Katakana? What is 1988.enc (was: ISO-8859-1 vs ISO 8859-1 )

2002-03-20 Thread Jean-Michel Hiver
> Do Chinese use Katakana? Does such encoding make > sense? Not as far as I know. Hiragana / Katakana is exclusively a Japanese thing ain't it? Cheers, -- IT'S TIME FOR A DIFFERENT KIND OF WEB Jean-Michel Hiver - Software Direct

Re: [PATCH] open.pm tweaks

2002-03-20 Thread Jarkko Hietaniemi
On Wed, Mar 20, 2002 at 09:12:54PM +0800, Autrijus Tang wrote: > I was wrong when suggesting the zh_TW should mean big5; didn't look at > the code carefully enoough to see there's a glaring > > $locale_encoding eq 'euc' && > > conditional. Sorry. > > Also patched is the problem of incorrect

Fwd: Do Chinese use Katakana? What is 1988.enc (was: ISO-8859-1 vs ISO 8859-1 )

2002-03-20 Thread Anton Tagunov
This is a forwarded message From: Anton Tagunov <[EMAIL PROTECTED]> To: Nick Ing-Simmons <[EMAIL PROTECTED]> Date: Tuesday, March 19, 2002, 7:35:43 PM Subject: Do Chinese use Katakana? What is 1988.enc (was: ISO-8859-1 vs ISO 8859-1 ) ===8<==Original message text=== Hello,

Fwd: [PATCH][docs] Encode.pm

2002-03-20 Thread Anton Tagunov
This is a forwarded message From: Anton Tagunov <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] <[EMAIL PROTECTED]> Date: Tuesday, March 19, 2002, 9:48:06 PM Subject: [PATCH][docs] Encode.pm ===8<==Original message text=== Hello, developers! With my upgraded knowledge of encodi

Fwd: GB 1833 name abiguous

2002-03-20 Thread Anton Tagunov
This is a forwarded message From: Anton Tagunov <[EMAIL PROTECTED]> To: Nick Ing-Simmons <[EMAIL PROTECTED]> Date: Tuesday, March 19, 2002, 7:12:26 PM Subject: ISO-8859-1 vs ISO 8859-1 (typo + UTF8 case too :) ===8<==Original message text=== Hello Nick! Hello, all! Confes

[PATCH] open.pm tweaks

2002-03-20 Thread Autrijus Tang
I was wrong when suggesting the zh_TW should mean big5; didn't look at the code carefully enoough to see there's a glaring $locale_encoding eq 'euc' && conditional. Sorry. Also patched is the problem of incorrectly probing for the 'encoding(xxx)' layer, instead of just 'encoding' itself. Be

Re: [PATCH][docs] Encode.pm

2002-03-20 Thread Nick Ing-Simmons
Anton Tagunov <[EMAIL PROTECTED]> writes: >Hello, Nick! > >With my upgraded knowledge of encoding naming I propose this. > >Justification: > >1) > Shift-JIS -> Shift_JIS does not hurt anyone, cause it does > not work either way, Encode::encode > und

Re: Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Autrijus Tang
On Wed, Mar 20, 2002 at 06:34:33PM +0900, Dan Kogai wrote: > I also added zh_TW.Big5 to alias testing. cool. :-) HanExtra 0.03 is on CPAN now. /Autrijus/ msg00781/pgp0.pgp Description: PGP signature

Re: Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Dan Kogai
On Wednesday, March 20, 2002, at 06:06 , Autrijus Tang wrote: > Ok, I'll fix it. Attached please find aliases to TW.pm, and a > pedantic correction to Encode.pm usage. Thank you. Patches applied fine and 'make test' is hairless. Will be available in the next release which I am planning to up

Re: Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Autrijus Tang
On Wed, Mar 20, 2002 at 05:38:52PM +0900, Dan Kogai wrote: > + Encoding aliases added. Note TW is left untouched because > euc-tw is not implemented in TW but in Encode::HanExtra. > Autrijus, you may fix Encode::HanExtra. Ok, I'll fix it. Attached please find aliases to TW.pm, and a pe

Encode-0.93 released; Uploaded to CPAN

2002-03-20 Thread Dan Kogai
Encode Hackers, I just uploaded Encode-0.93 to CPAN, also avaiable as http://www.dan.co.jp/~dankogai/Encode-0.93.tar.gz Changes Since 0.92 includes; + Changes + AUTHORS ! Encode.pm ! README + Mention to [EMAIL PROTECTED] added ! JP/JP.pm + Encoding aliases added so you can feed l