On Fri, 6 Jun 2008, Isaac Colley wrote:

Hello,

I am building a dictionary based language detection program using the
dumps of aspell dictionaries.

I need to expand wordlists completely, however some languages, such as
Russian, after expansion will leave behind affixes (I think) after a
'?'.  For example:

aspell dump master ru | aspell -l ru expand

I think it might be an encoding problem. Try setting your locale to C by setting the LANG environmental variable and making sure the locale was changes. For example using bash:

  $ export LANG=C
  $ locale
  LANG=C
  LC_CTYPE="C"
  LC_COLLATE="C"
  LC_TIME="C"
  LC_NUMERIC="C"
  LC_MONETARY="C"
  LC_MESSAGES="C"
  LC_ALL=

You might also need to set LC_ALL.



_______________________________________________
Aspell-user mailing list
Aspell-user@gnu.org
http://lists.gnu.org/mailman/listinfo/aspell-user

Reply via email to