[Encode] 1.77 Released
Porters, I am releasing Encode 1.77 to accommodate the up-and-coming changes to bleedperl that makes tr/// free of eval qq{} under "use encoding" pragma. This problem was addressed by me as; > From: Dan Kogai <[EMAIL PROTECTED]> > Date: Thu Oct 3, 2002 20:31:13 Asia/Tokyo > To: [EMAIL PROTECTED] > Cc: [EMAIL PROTECTED] > Subject: tr/// and use encoding > Message-Id: <[EMAIL PROTECTED]> Whole Package -- to be uploaded to CPAN soon: http://www.dan.co.jp/~dankogai/Encode-1.77.tar.gz Patch against bleedperl (242 lines) for Hugo: http://www.dan.co.jp/~dankogai/current-1.77.diff.gz And here is the Changes. This one also includes minor alias fix by Autrijus. $Revision: 1.77 $ $Date: 2002/10/06 03:27:02 $ ! t/jperl.t * Modified to accomodate up and comming patch by Inaba-san that will fix tr/// needing eval qq{} Message-Id: <[EMAIL PROTECTED]> ! encoding.pm * pod fixes/enhancements to reflect the changes above ! lib/Encode/Alias.pm "Encode::TW is correct, Encode::Alias not." - /Autrijus/ Message-Id: <[EMAIL PROTECTED]> Note this update alone will NOT fix the tr/// vs. use encoding problem noted above, the real fix needs to be applied to bleedperl (regcomp.c and such). The patch was already made available by Inaba Hiroto <[EMAIL PROTECTED]> (IsP for short)and my preliminary tests already show that it works with a minor fix to ext/Encode/t/jperl.t. Hence the update to Encode prior to bleedperl patch. Here is the proposed schedule before IsP goes into bleedperl; On Sunday, Oct 6, 2002, at 12:10 Asia/Tokyo, Jarkko Hietaniemi wrote: >> Okay, how about the schedule below? >> >> 0) I release IsP-safe Encode 1.77 to CPAN. All I have to do is to >> comment out the local(${^ENCODE}) black magic so it is still test-safe >> under perl 5.8.0 >> >> 1) Hugo to sync bleedperl w/ Encode 1.77 >> >> 2) New *.t for IsP under somewhere OTHER THAN ext/Encode/t. >> lib/nihongo.t, maybe? that is to test thoroughly what IsP brings. >> That test suite does not have to run on stock perl 5.8.0 >> >> 3) With enough assurance w/ the test suite above, IsP goes into >> bleedperl. > > Sounds good to me. The Encode update complies step 0. Though this update is primarily for bleedperl, it is perl 5.8.0-safe. Please allow some time before step 2-3. I would like Inaba-san's help on 2 and I am rather busy recently Dan the Encode Maintainer
Re[6]: Encode::compat 0.01 says "Unsupported conversion"
Hi, On Fri, 4 Oct 2002, 22:48 GMT+08 (16:48 local time) Autrijus Tang wrote: > Would it be too much to ask for a list of such libiconv-incompatibility > map, or at least a pointer to the supported canonical names of those > recoders? currently, the Encode::Alias module used by your Encode::compat does not always resolve to an encoding name known by GNU libiconv. The current version of libiconv, 1.8, knows the following names and aliases: ANSI_X3.4-1968 ANSI_X3.4-1986 ASCII CP367 IBM367 ISO-IR-6 ISO646-US ISO_646.IRV:1991 US US-ASCII CSASCII UTF-8 ISO-10646-UCS-2 UCS-2 CSUNICODE UCS-2BE UNICODE-1-1 UNICODEBIG CSUNICODE11 UCS-2LE UNICODELITTLE ISO-10646-UCS-4 UCS-4 CSUCS4 UCS-4BE UCS-4LE UTF-16 UTF-16BE UTF-16LE UTF-32 UTF-32BE UTF-32LE UNICODE-1-1-UTF-7 UTF-7 CSUNICODE11UTF7 UCS-2-INTERNAL UCS-2-SWAPPED UCS-4-INTERNAL UCS-4-SWAPPED C99 JAVA CP819 IBM819 ISO-8859-1 ISO-IR-100 ISO_8859-1 ISO_8859-1:1987 L1 LATIN1 CSISOLATIN1 ISO-8859-2 ISO-IR-101 ISO_8859-2 ISO_8859-2:1987 L2 LATIN2 CSISOLATIN2 ISO-8859-3 ISO-IR-109 ISO_8859-3 ISO_8859-3:1988 L3 LATIN3 CSISOLATIN3 ISO-8859-4 ISO-IR-110 ISO_8859-4 ISO_8859-4:1988 L4 LATIN4 CSISOLATIN4 CYRILLIC ISO-8859-5 ISO-IR-144 ISO_8859-5 ISO_8859-5:1988 CSISOLATINCYRILLIC ARABIC ASMO-708 ECMA-114 ISO-8859-6 ISO-IR-127 ISO_8859-6 ISO_8859-6:1987 CSISOLATINARABIC ECMA-118 ELOT_928 GREEK GREEK8 ISO-8859-7 ISO-IR-126 ISO_8859-7 ISO_8859-7:1987 CSISOLATINGREEK HEBREW ISO-8859-8 ISO-IR-138 ISO_8859-8 ISO_8859-8:1988 CSISOLATINHEBREW ISO-8859-9 ISO-IR-148 ISO_8859-9 ISO_8859-9:1989 L5 LATIN5 CSISOLATIN5 ISO-8859-10 ISO-IR-157 ISO_8859-10 ISO_8859-10:1992 L6 LATIN6 CSISOLATIN6 ISO-8859-13 ISO-IR-179 ISO_8859-13 L7 LATIN7 ISO-8859-14 ISO-CELTIC ISO-IR-199 ISO_8859-14 ISO_8859-14:1998 L8 LATIN8 ISO-8859-15 ISO-IR-203 ISO_8859-15 ISO_8859-15:1998 ISO-8859-16 ISO-IR-226 ISO_8859-16 ISO_8859-16:2000 KOI8-R CSKOI8R KOI8-U KOI8-RU CP1250 MS-EE WINDOWS-1250 CP1251 MS-CYRL WINDOWS-1251 CP1252 MS-ANSI WINDOWS-1252 CP1253 MS-GREEK WINDOWS-1253 CP1254 MS-TURK WINDOWS-1254 CP1255 MS-HEBR WINDOWS-1255 CP1256 MS-ARAB WINDOWS-1256 CP1257 WINBALTRIM WINDOWS-1257 CP1258 WINDOWS-1258 850 CP850 IBM850 CSPC850MULTILINGUAL 862 CP862 IBM862 CSPC862LATINHEBREW 866 CP866 IBM866 CSIBM866 MAC MACINTOSH MACROMAN CSMACINTOSH MACCENTRALEUROPE MACICELAND MACCROATIAN MACROMANIA MACCYRILLIC MACUKRAINE MACGREEK MACTURKISH MACHEBREW MACARABIC MACTHAI HP-ROMAN8 R8 ROMAN8 CSHPROMAN8 NEXTSTEP ARMSCII-8 GEORGIAN-ACADEMY GEORGIAN-PS KOI8-T MULELAO-1 CP1133 IBM-CP1133 ISO-IR-166 TIS-620 TIS620 TIS620-0 TIS620.2529-1 TIS620.2533-0 TIS620.2533-1 CP874 WINDOWS-874 VISCII VISCII1.1-1 CSVISCII TCVN TCVN-5712 TCVN5712-1 TCVN5712-1:1993 ISO-IR-14 ISO646-JP JIS_C6220-1969-RO JP CSISO14JISC6220RO JISX0201-1976 JIS_X0201 X0201 CSHALFWIDTHKATAKANA ISO-IR-87 JIS0208 JIS_C6226-1983 JIS_X0208 JIS_X0208-1983 JIS_X0208-1990 X0208 CSISO87JISX0208 ISO-IR-159 JIS_X0212 JIS_X0212-1990 JIS_X0212.1990-0 X0212 CSISO159JISX02121990 CN GB_1988-80 ISO-IR-57 ISO646-CN CSISO57GB1988 CHINESE GB_2312-80 ISO-IR-58 CSISO58GB231280 CN-GB-ISOIR165 ISO-IR-165 ISO-IR-149 KOREAN KSC_5601 KS_C_5601-1987 KS_C_5601-1989 CSKSC56011987 EUC-JP EUCJP EXTENDED_UNIX_CODE_PACKED_FORMAT_FOR_JAPANESE CSEUCPKDFMTJAPANESE MS_KANJI SHIFT-JIS SHIFT_JIS SJIS CSSHIFTJIS CP932 ISO-2022-JP CSISO2022JP ISO-2022-JP-1 ISO-2022-JP-2 CSISO2022JP2 CN-GB EUC-CN EUCCN GB2312 CSGB2312 CP936 GBK GB18030 ISO-2022-CN CSISO2022CN ISO-2022-CN-EXT HZ HZ-GB-2312 EUC-TW EUCTW CSEUCTW BIG-5 BIG-FIVE BIG5 BIGFIVE CN-BIG5 CSBIG5 CP950 BIG5-HKSCS BIG5HKSCS EUC-KR EUCKR CSEUCKR CP949 UHC CP1361 JOHAB ISO-2022-KR CSISO2022KR 437 CP437 IBM437 CSPC8CODEPAGE437 CP737 CP775 IBM775 CSPC775BALTIC 852 CP852 IBM852 CSPCP852 CP853 855 CP855 IBM855 CSIBM855 857 CP857 IBM857 CSIBM857 CP858 860 CP860 IBM860 CSIBM860 861 CP-IS CP861 IBM861 CSIBM861 863 CP863 IBM863 CSIBM863 CP864 IBM864 CSIBM864 865 CP865 IBM865 CSIBM865 869 CP-GR CP869 IBM869 CSIBM869 CP1125 The list above is the result of 'iconv -l'. I browsed a bit through the source files and found that all these names are defined in the files lib/encodings*.def. All the platform independent encodings are defined in 'encodings.def' which contains the following: /* Copyright (C) 1999-2002 Free Software Foundation, Inc. This file is part of the GNU LIBICONV Library. The GNU LIBICONV Library is free software; you can redistribute it and/or modify it under the terms of the GNU Library General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. The GNU LIBICONV Library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Library General Public License for more details. You should have received a copy of the GNU Library General Public License along with the GNU LIBICONV Library; see the file COPYING.LIB. If