Jim Maul wrote:
> Ingo Reinhart wrote:
> 
>> Hello!
>>
>> If I commit a big mail (32 MB) to sa-learn it need long time. I must
>> wait 50 sec. and the sa-learn process need 332 MB RAM.
>>
>> What can I do for faster proceed?
>>
>> Ingo
>>
>>
>>
>>
> 
> um..since messages over 250k (default) wont be scanned by SA, why bother
> sa-learning anything over this limit?  Sa isnt going to scan it anyway.
> 
> -Jim
> 




Based on the way bayes works, that doesn't make much sense Jim.

Bayes doesn't learn messages, it learns tokens from within messages.

Really, you don't care if SA is going to scan messages of the same size or not.
You care if it will scan messages with some of the same content.

It's quite possible the 32mb is a large version of a message that's normally
short. For example logwatch output.

The only reason training the 32mb message would be pointless would be if it only
contained content that would be in similarly large messages.



Minor Note of Clarification: that 250k default limit applies to those who use
spamd, which admittedly Ingo does use. But it is not inherent in spamassassin in
general (i.e. those using the API or spamassassin command-line don't have this
feature unless implemented elsewhere)




Reply via email to