Hello,

I am building a dictionary based language detection program using the
dumps of aspell dictionaries.

I need to expand wordlists completely, however some languages, such as
Russian, after expansion will leave behind affixes (I think) after a
'?'.  For example:

aspell dump master ru | aspell -l ru expand

will produce lines like:
умаслит? умаслит?ла умаслит?ли умаслит?ло

'умаслит' appears to be the stem, but what about the characters after
the '?'.  Are they affixes?  If so, how do I fully expand them.  Any
insight on how to correctly expand wordlists for every language would be
greatly appreciated. 

Thanks,
Isaac Colley


_______________________________________________
Aspell-user mailing list
Aspell-user@gnu.org
http://lists.gnu.org/mailman/listinfo/aspell-user

Reply via email to