On Fri, 2008-02-08 at 01:49 -0500, Gene Heskett wrote:
> The sa-learn --spam can process a message in 5 to 10 seconds or so, so if 
> I've 
> dropped 20 doofus mails in the spam directory and fire it off, I have it done 
> and kmail is back among the living in 2-3 minutes.

This seems *way* too high. If there have been only 20 messages total in
that folder, sa-learn should have processed these in a few *seconds* or
less.


> But, feeding it a 'ham' directory with about 7k messages in it, turned 
> sa-learn into a 100% cpu hog, [...]

What did you expect? Based on your numbers above, processing that folder
would have taken 10-20 *hours*...


> incrementing the message processed number only 
> about every 3 to 5 minutes. I couldn't kill it, it kept coming back and I 
> must have fed it a kill -9 50 times.

Hmm. Kmail doesn't start one process per mail by any chance?


> So what is the maximum number of files in a directory that one can feed to 
> sa-learn --ham and expect it to achieve normal speed?

Dunno if there are limitations -- however, your 7k messages should be
perfectly fine. Just ran a test on a 6k messages mbox file, and there
was no noticeable difference to a 30 messages test.


> The command that kmail issues to it is:
> sa-learn --ham  /root/Mail/(foldername)/cur

You're not using root as your ordinary user account, do you !?

  guenther


-- 
char *t="[EMAIL PROTECTED]";
main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1:
(c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}

Reply via email to