[Encode] 1.77 Released

2002-10-05 Thread Dan Kogai

Porters,

   I am releasing Encode 1.77 to accommodate the up-and-coming changes 
to bleedperl that makes tr/// free of eval qq{} under "use encoding" 
pragma.  This problem was addressed by me as;

> From: Dan Kogai <[EMAIL PROTECTED]>
> Date: Thu Oct 3, 2002  20:31:13 Asia/Tokyo
> To: [EMAIL PROTECTED]
> Cc: [EMAIL PROTECTED]
> Subject: tr/// and use encoding
> Message-Id: <[EMAIL PROTECTED]>


Whole Package -- to be uploaded to CPAN soon:
http://www.dan.co.jp/~dankogai/Encode-1.77.tar.gz

Patch against bleedperl (242 lines) for Hugo:
http://www.dan.co.jp/~dankogai/current-1.77.diff.gz

And here is the Changes.  This one also includes minor alias fix by 
Autrijus.

$Revision: 1.77 $ $Date: 2002/10/06 03:27:02 $
! t/jperl.t
   * Modified to accomodate up and comming patch by Inaba-san that
 will fix tr/// needing eval qq{}
 Message-Id: <[EMAIL PROTECTED]>
! encoding.pm
   * pod fixes/enhancements to reflect the changes above
! lib/Encode/Alias.pm
   "Encode::TW is correct, Encode::Alias not." - /Autrijus/
   Message-Id: <[EMAIL PROTECTED]>

Note this update alone will NOT fix the tr/// vs. use encoding problem 
noted above,  the real fix needs to be applied to bleedperl (regcomp.c 
and such).  The patch was already made available by Inaba Hiroto 
<[EMAIL PROTECTED]> (IsP for short)and my preliminary tests already 
show that it works with a minor fix to ext/Encode/t/jperl.t.  Hence the 
update to Encode prior to bleedperl patch.

Here is the proposed schedule before IsP goes into bleedperl;

On Sunday, Oct 6, 2002, at 12:10 Asia/Tokyo, Jarkko Hietaniemi wrote:
>> Okay, how about the schedule below?
>>
>> 0)   I release IsP-safe Encode 1.77 to CPAN.  All I have to do is to
>> comment out the local(${^ENCODE}) black magic so it is still test-safe
>> under perl 5.8.0
>>
>> 1)  Hugo to sync bleedperl w/ Encode 1.77
>>
>> 2)  New *.t for IsP under somewhere OTHER THAN ext/Encode/t.
>> lib/nihongo.t, maybe?  that is to test thoroughly what IsP brings.
>> That test suite does not have to run on stock perl 5.8.0
>>
>> 3)  With enough assurance w/ the test suite above, IsP goes into
>> bleedperl.
>
> Sounds good to me.

The Encode update complies step 0.  Though this update is primarily for 
bleedperl, it is perl 5.8.0-safe.

Please allow some time before step 2-3.  I would like Inaba-san's help 
on 2 and I am rather busy recently

Dan the Encode Maintainer




Re[6]: Encode::compat 0.01 says "Unsupported conversion"

2002-10-05 Thread Robert Allerstorfer

Hi,

On Fri, 4 Oct 2002, 22:48 GMT+08 (16:48 local time) Autrijus Tang
wrote:

> Would it be too much to ask for a list of such libiconv-incompatibility
> map, or at least a pointer to the supported canonical names of those
> recoders?

currently, the Encode::Alias module used by your Encode::compat does
not always resolve to an encoding name known by GNU libiconv. The
current version of libiconv, 1.8, knows the following names and
aliases:

ANSI_X3.4-1968 ANSI_X3.4-1986 ASCII CP367 IBM367 ISO-IR-6 ISO646-US ISO_646.IRV:1991 
US US-ASCII CSASCII
UTF-8
ISO-10646-UCS-2 UCS-2 CSUNICODE
UCS-2BE UNICODE-1-1 UNICODEBIG CSUNICODE11
UCS-2LE UNICODELITTLE
ISO-10646-UCS-4 UCS-4 CSUCS4
UCS-4BE
UCS-4LE
UTF-16
UTF-16BE
UTF-16LE
UTF-32
UTF-32BE
UTF-32LE
UNICODE-1-1-UTF-7 UTF-7 CSUNICODE11UTF7
UCS-2-INTERNAL
UCS-2-SWAPPED
UCS-4-INTERNAL
UCS-4-SWAPPED
C99
JAVA
CP819 IBM819 ISO-8859-1 ISO-IR-100 ISO_8859-1 ISO_8859-1:1987 L1 LATIN1 CSISOLATIN1
ISO-8859-2 ISO-IR-101 ISO_8859-2 ISO_8859-2:1987 L2 LATIN2 CSISOLATIN2
ISO-8859-3 ISO-IR-109 ISO_8859-3 ISO_8859-3:1988 L3 LATIN3 CSISOLATIN3
ISO-8859-4 ISO-IR-110 ISO_8859-4 ISO_8859-4:1988 L4 LATIN4 CSISOLATIN4
CYRILLIC ISO-8859-5 ISO-IR-144 ISO_8859-5 ISO_8859-5:1988 CSISOLATINCYRILLIC
ARABIC ASMO-708 ECMA-114 ISO-8859-6 ISO-IR-127 ISO_8859-6 ISO_8859-6:1987 
CSISOLATINARABIC
ECMA-118 ELOT_928 GREEK GREEK8 ISO-8859-7 ISO-IR-126 ISO_8859-7 ISO_8859-7:1987 
CSISOLATINGREEK
HEBREW ISO-8859-8 ISO-IR-138 ISO_8859-8 ISO_8859-8:1988 CSISOLATINHEBREW
ISO-8859-9 ISO-IR-148 ISO_8859-9 ISO_8859-9:1989 L5 LATIN5 CSISOLATIN5
ISO-8859-10 ISO-IR-157 ISO_8859-10 ISO_8859-10:1992 L6 LATIN6 CSISOLATIN6
ISO-8859-13 ISO-IR-179 ISO_8859-13 L7 LATIN7
ISO-8859-14 ISO-CELTIC ISO-IR-199 ISO_8859-14 ISO_8859-14:1998 L8 LATIN8
ISO-8859-15 ISO-IR-203 ISO_8859-15 ISO_8859-15:1998
ISO-8859-16 ISO-IR-226 ISO_8859-16 ISO_8859-16:2000
KOI8-R CSKOI8R
KOI8-U
KOI8-RU
CP1250 MS-EE WINDOWS-1250
CP1251 MS-CYRL WINDOWS-1251
CP1252 MS-ANSI WINDOWS-1252
CP1253 MS-GREEK WINDOWS-1253
CP1254 MS-TURK WINDOWS-1254
CP1255 MS-HEBR WINDOWS-1255
CP1256 MS-ARAB WINDOWS-1256
CP1257 WINBALTRIM WINDOWS-1257
CP1258 WINDOWS-1258
850 CP850 IBM850 CSPC850MULTILINGUAL
862 CP862 IBM862 CSPC862LATINHEBREW
866 CP866 IBM866 CSIBM866
MAC MACINTOSH MACROMAN CSMACINTOSH
MACCENTRALEUROPE
MACICELAND
MACCROATIAN
MACROMANIA
MACCYRILLIC
MACUKRAINE
MACGREEK
MACTURKISH
MACHEBREW
MACARABIC
MACTHAI
HP-ROMAN8 R8 ROMAN8 CSHPROMAN8
NEXTSTEP
ARMSCII-8
GEORGIAN-ACADEMY
GEORGIAN-PS
KOI8-T
MULELAO-1
CP1133 IBM-CP1133
ISO-IR-166 TIS-620 TIS620 TIS620-0 TIS620.2529-1 TIS620.2533-0 TIS620.2533-1
CP874 WINDOWS-874
VISCII VISCII1.1-1 CSVISCII
TCVN TCVN-5712 TCVN5712-1 TCVN5712-1:1993
ISO-IR-14 ISO646-JP JIS_C6220-1969-RO JP CSISO14JISC6220RO
JISX0201-1976 JIS_X0201 X0201 CSHALFWIDTHKATAKANA
ISO-IR-87 JIS0208 JIS_C6226-1983 JIS_X0208 JIS_X0208-1983 JIS_X0208-1990 X0208 
CSISO87JISX0208
ISO-IR-159 JIS_X0212 JIS_X0212-1990 JIS_X0212.1990-0 X0212 CSISO159JISX02121990
CN GB_1988-80 ISO-IR-57 ISO646-CN CSISO57GB1988
CHINESE GB_2312-80 ISO-IR-58 CSISO58GB231280
CN-GB-ISOIR165 ISO-IR-165
ISO-IR-149 KOREAN KSC_5601 KS_C_5601-1987 KS_C_5601-1989 CSKSC56011987
EUC-JP EUCJP EXTENDED_UNIX_CODE_PACKED_FORMAT_FOR_JAPANESE CSEUCPKDFMTJAPANESE
MS_KANJI SHIFT-JIS SHIFT_JIS SJIS CSSHIFTJIS
CP932
ISO-2022-JP CSISO2022JP
ISO-2022-JP-1
ISO-2022-JP-2 CSISO2022JP2
CN-GB EUC-CN EUCCN GB2312 CSGB2312
CP936 GBK
GB18030
ISO-2022-CN CSISO2022CN
ISO-2022-CN-EXT
HZ HZ-GB-2312
EUC-TW EUCTW CSEUCTW
BIG-5 BIG-FIVE BIG5 BIGFIVE CN-BIG5 CSBIG5
CP950
BIG5-HKSCS BIG5HKSCS
EUC-KR EUCKR CSEUCKR
CP949 UHC
CP1361 JOHAB
ISO-2022-KR CSISO2022KR
437 CP437 IBM437 CSPC8CODEPAGE437
CP737
CP775 IBM775 CSPC775BALTIC
852 CP852 IBM852 CSPCP852
CP853
855 CP855 IBM855 CSIBM855
857 CP857 IBM857 CSIBM857
CP858
860 CP860 IBM860 CSIBM860
861 CP-IS CP861 IBM861 CSIBM861
863 CP863 IBM863 CSIBM863
CP864 IBM864 CSIBM864
865 CP865 IBM865 CSIBM865
869 CP-GR CP869 IBM869 CSIBM869
CP1125


The list above is the result of 'iconv -l'. I browsed a bit through
the source files and found that all these names are defined in the
files lib/encodings*.def. All the platform independent encodings are
defined in 'encodings.def' which contains the following:


/* Copyright (C) 1999-2002 Free Software Foundation, Inc.
   This file is part of the GNU LIBICONV Library.

   The GNU LIBICONV Library is free software; you can redistribute it
   and/or modify it under the terms of the GNU Library General Public
   License as published by the Free Software Foundation; either version 2
   of the License, or (at your option) any later version.

   The GNU LIBICONV Library is distributed in the hope that it will be
   useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
   Library General Public License for more details.

   You should have received a copy of the GNU Library General Public
   License along with the GNU LIBICONV Library; see the file COPYING.LIB.
   If