[EMAIL PROTECTED] (Dan Kogai) wrote in
news:1E456D1E-E7DE-11D6-BF8B-0003939A104C@;dan.co.jp:
> On Friday, Oct 25, 2002, at 14:10 Asia/Tokyo, Philip Newton wrote:
>> Well, partially because there's no "good" names for many of the
>> characters. What do you call "生"? "CJK UNIFIED IDEOGRAPH-751F"?
>> (That's the current Unicode "name", but it's not particularly useful.)
>> "CJK shou"? "CJK sei"? "CJK sheng1"? "CJK saeng"? "CJK ikiru"? ikasu,
>> ikeru, umareru, umu, ou, haeru, hayasu, ki, nama, naru, nasu, musu,
>> .... which one do you pick?
>
> If we are stuck with de jure, ex officio names from Unicode Consortium
> we are out of luck but this is perl; if there are more than one way to
> do it, Why not more than one way to name it? I am kind of wondering a
> charnames extension that goes like
>
> use charnames ":ja"; # Japanese
> print "\N{sei-ikiru}";
> #
> use charnames ":ko";
> print "\N{saeng}";
> #
> use charanames ":zh";
> print "\N{sheng1}";
All ideal for the new aliassing module!
use charnames ":full", ":alias" => "ja";
which is the same as
use charnames ":alias" => ":ja";
Maybe we should supply such alias files, which have no restriction in the
number of aliases to the same long name.
Asuming sei-ikiru's real name is "CHINESE BLUBBER POND WITH FROGS", there
is no problem with
use charnames ":full", ":alias" => {
"sei-ikiru" => "CHINESE BLUBBER POND WITH FROGS",
"saeng" => "CHINESE BLUBBER POND WITH FROGS",
"sheng1" => "CHINESE BLUBBER POND WITH FROGS",
"frog-pond" => "CHINESE BLUBBER POND WITH FROGS",
};
Forgive my ignorance of korean, japanese, chinese and CJK codings in
general. Just pointing out the new welth of possibilities.
Now we can support
unicore/ko_alias.pl
unicore/ja_alias.pl
unicore/zh_alias.pl
...
> Since pragmatic approach is rather inflexible, I would prefer OO
> aproach, like
>
> use Char::Name;
>
> my $char = Char::Name->new;
>
> print $char->jp("sei-ikiru");
>
> I know Japanese is the biggest nightmare to name characters because in
> Japanese we give too many "names" to each character; It's really hard
> to disambiguate these....
>
> I may come up with something as I look though Unihan DB, now accessible
> via CPAN (Unicode::Unihan)....
>
>> Cheers,
>> Philip Newton (不衣律不入豚)
>
> \x{5c0f}\x{98fc} \x{5f3e}
>
>
--
H.Merijn Brand Amsterdam Perl Mongers (http://www.amsterdam.pm.org/)
using perl-5.6.1, 5.7.2 & 630 on HP-UX 10.20 & 11.00, AIX 4.2, AIX 4.3,
WinNT 4, Win2K pro & WinCE 2.11 often with Tk800.022 &/| DBD-Unify
ftp://ftp.funet.fi/pub/languages/perl/CPAN/authors/id/H/HM/HMBRAND/