the cleaned structure over to it brought my 20MB whitelist to
600KB (after --min 3). I'd just send a patch, but I noticed there are
also '.pag' and '.dir' files which I didn't know the purpose of. Are
they safe to leave as is with the compacted db? Should they also be
treated somehow?
--
Gaal Yahas
recognition tool would be needed, but fortunately
there are some of those around.[1]
This adds quite a bit of complexity. Has anyone given it any thought?
[1] E.g. http://neugierig.org/software/langid/.
--
Gaal Yahas [EMAIL PROTECTED]
http://gaal.livejournal.com/
score for a particular
token is? I'm just curious :)
Thanks a lot,
Gaal
--
Gaal Yahas [EMAIL PROTECTED]
http://gaal.livejournal.com/