David Golden wrote:
José Castro wrote:
- you give it a set of texts (usually two, but can be more)
- without knowing the language of each text, the module tells you how
likely it is that those texts are translations of each other
It would probably be something in the "Text" of "Lingua" namespace,
and it
would probably include something like "Translations" or "Parallel".
I tend to think of language things in Lingua and text processing in
Text, so I'd lean towards Lingua. After that, my first thought was the
unwieldy:
Lingua::Translation::Identify
or
Lingua::Translation::Compare
I'm not sure if it makes sense to shorten it to the existing
Lingua::Translate namespace for consistency sake:
Lingua::Translate::Identify
Lingua::Translate::Compare
As I look at this, I lean slightly towards "compare" since it definitely
compares, but doesn't necessarily "identify".
Often, 'similar' is used in this context, algorithms that compare or
rate things for equivelance.
Lingua::Text::Similar
Lingua::Translation::Similar
Regards,
Randy.