check_whitelist does not compact

2006-11-15 Thread Gaal Yahas
Hello,

Although "check_whitelist --clean" removes entries from the auto whitelist
database, it does not compact the file, at least on my system (latest
SpamAssassin on debian unstable, perl 5.8.8, DB_File 1.814).

A minor modification of the check script that ties a new Berkley db and
copies the cleaned structure over to it brought my 20MB whitelist to
600KB (after --min 3). I'd just send a patch, but I noticed there are
also '.pag' and '.dir' files which I didn't know the purpose of. Are
they safe to leave as is with the compacted db? Should they also be
treated somehow?

-- 
Gaal Yahas <[EMAIL PROTECTED]>
http://gaal.livejournal.com/


Spellcheck plugin?

2004-10-06 Thread Gaal Yahas
I'm wondering, would it be useful to have a plugin that penalizes messages
with many spelling mistakes? This might help against all those creative
ways of spelling out what the spammer wants to sell.

To have any hope of working for people who expect mail in more than one
language, a language recognition tool would be needed, but fortunately
there are some of those around.[1]

This adds quite a bit of complexity. Has anyone given it any thought?


[1] E.g. <http://neugierig.org/software/langid/>.

-- 
Gaal Yahas <[EMAIL PROTECTED]>
http://gaal.livejournal.com/


Concurrent spamc via procmail

2004-09-29 Thread Gaal Yahas
I'm using SpamAssassin 3.0.0 on Debian unstable.

The docs recommend invoking spamassassin via procmail like this:

 :0fw: spamassassin.lock
 * < 256000
 | spamassassin

And to simply replace 'spamassassin' with 'spamc' when using the spamd
setup. I just noticed that spamd preforks, so my question is whether it
still makes sense to put in the lock, and if not, what limits the number
of spamc instances that start up when I receive many mails in a short
time.



And a quickie (sorry for not asking this in a separate thread): is there
a one-liner I can use to show what the Bayes score for a particular
token is? I'm just curious :)

Thanks a lot,
Gaal

-- 
Gaal Yahas <[EMAIL PROTECTED]>
http://gaal.livejournal.com/