The following module was proposed for inclusion in the Module List:
modid: Lingua::RU::Translit
DSLIP: bdcfp
description: Converts texts from "translit" to koi8-r
userid: SERG (Sergei Golubchik)
chapterid: 11 (String_Lang_Text_Proc)
communities:
similar:
Lingua::RU::Charset - converts between various Cyrillic charsets
rationale:
well, I failed to find a module that converts from Russian
"translit" (that is Russian text approximated by latin1 charset)
back to Cyrillic. Lingua::RU::Charset and Convert::Cyrillic (also
more generic Convert::Translit and
iconv-based-don't-remember-the-name) convert between Cyrillic
charsets, but they only do one-to-one translation (simple
tr/.../.../). Lingua::RU::Charset can convert to "translit" - it's
one-to-many lossy convertion (tr/.../.../ and few s/./.../g). As
cyrillic-translit conversion is lossy, translit-cyrillic conversion
is ambiguous, so this module uses Hidden Markov Model to choose a
most probable cyrillic original. Here's "uniqueness of approach" -
this is the only charset conversion module (though, russian
"translit" is not technically a charset), that is not a set of tr///
and s/// commands (and cannot be rewritten as such).
enteredby: SERG (Sergei Golubchik)
enteredon: Mon Oct 21 19:15:26 2002 GMT
The resulting entry would be:
Lingua::RU::
::Translit bdcfp Converts texts from "translit" to koi8-r SERG
Thanks for registering,
The Pause Team
PS: The following links are only valid for module list maintainers:
Registration form with editing capabilities:
https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=93300000_9653ec323c84378c&SUBMIT_pause99_add_mod_preview=1
Immediate (one click) registration:
https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=93300000_9653ec323c84378c&SUBMIT_pause99_add_mod_insertit=1