Re: Advice on how to approach character translation

2008-04-26 Thread Dr.Ruud
Jenda Krynicky schreef: @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; (there was a comma missing) which you could even write as @signs = map quotemeta, @signs; -- Affijn, Ruud Gewoon is een tijger. sub uniq { my $prev; map $_

Re: Advice on how to approach character translation

2008-04-26 Thread J. Peng
On Sat, Apr 26, 2008 at 5:41 PM, Dr.Ruud [EMAIL PROTECTED] wrote: Jenda Krynicky schreef: @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; (there was a comma missing) which you could even write as @signs = map quotemeta, @signs; or:

Re: Advice on how to approach character translation

2008-04-26 Thread Dr.Ruud
J. Peng schreef: Dr.Ruud: Jenda Krynicky: @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; (there was a comma missing) which you could even write as @signs = map quotemeta, @signs; or: @signs = map { quotemeta } @signs; That is not the same,

Re: Advice on how to approach character translation

2008-04-26 Thread J. Peng
On Sat, Apr 26, 2008 at 7:54 PM, Dr.Ruud [EMAIL PROTECTED] wrote: J. Peng schreef: Dr.Ruud: Jenda Krynicky: @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; (there was a comma missing) which you could even write as @signs = map

Re: Advice on how to approach character translation

2008-04-26 Thread Dr.Ruud
J. Peng schreef: Dr.Ruud: J. Peng: Dr.Ruud: Jenda Krynicky: @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; (there was a comma missing) which you could even write as @signs = map quotemeta, @signs; or: @signs = map { quotemeta } @signs; That

Re: Advice on how to approach character translation

2008-04-25 Thread R (Chandra) Chandrasekhar
Jenda Krynicky wrote: ... and build a regexp to match the 1-3 characters to replace: @signs = sort {length($b) = length($a)} keys %trans; Thanks for this priceless construct. It was very helpful indeed. @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; # needed a

Re: Advice on how to approach character translation

2008-04-24 Thread R (Chandra) Chandrasekhar
Chas. Owens wrote: The easiest way I can think of is to build a (UTF-8) file named itrans2unicode.table that looks like this a = a aa = ā ~N = ṅ I have successfully created the file lookup.table containing lines as suggested above with ASCII and Unicode characters separated by ' = '.

Re: Advice on how to approach character translation

2008-04-24 Thread Chas. Owens
On Thu, Apr 24, 2008 at 11:40 AM, R (Chandra) Chandrasekhar [EMAIL PROTECTED] wrote: Chas. Owens wrote: The easiest way I can think of is to build a (UTF-8) file named itrans2unicode.table that looks like this a = a aa = ā ~N = ṅ I have successfully created the file

Advice on how to approach character translation

2008-04-23 Thread R (Chandra) Chandrasekhar
Dear Folks, A scheme called ITRANS uses the ASCII printing character set and between one and three printing characters to unambiguously represent characters in Indic scripts or a Romanized script called IAST. Since characters in these scripts have Unicode code points, it should be possible

Re: Advice on how to approach character translation

2008-04-23 Thread Chas. Owens
On Wed, Apr 23, 2008 at 5:34 AM, R (Chandra) Chandrasekhar [EMAIL PROTECTED] wrote: Dear Folks, A scheme called ITRANS uses the ASCII printing character set and between one and three printing characters to unambiguously represent characters in Indic scripts or a Romanized script called

Re: Advice on how to approach character translation

2008-04-23 Thread Jenda Krynicky
From: R (Chandra) Chandrasekhar [EMAIL PROTECTED] 3. Some transliteration examples are shown below: a a U+0061 LATIN SMALL LETTER A aa a U+0101 LATIN SMALL LETTER A WITH MACRON A a U+0101 LATIN SMALL LETTER A WITH MACRON .a ' U+0027 APOSTROPHE ~N