To start, there are several very real things wrong with your example
message. In my opinion, that message was correctly classified.
Do you have any better-representative samples that you can paste in
full? (http://pastebin.com/)
Have you tried using "-D bayes" to see what tokens are being learned
incorrectly? Your score for BAYES_50 seems high for a message that gets
a neutral result from Bayes.
On 1/23/2015 8:55 AM, Wolf Drechsel wrote:
Hi everybody,
I googled and read a lot - but couldnt find any trick...
After months of training still round 90% of all messages are treated
as SPAM, allthough I'm marking all of them as HAM.
My environment:
Ubuntu 14.04
kmail 4.14.2 in the kontact (kdepim) suite
SpamAssassin version 3.4.0
running on Perl version 5.18.2
I tried this installation/config procedure:
http://www.spamtips.org/p/install-procedure.html
but nothing changed.
Here is one example:
2.6 FORGED_YAHOO_RCVD Gefälschte "Received"-Kopfzeile von yahoo.com
gefunden
0.2 FREEMAIL_ENVFROM_END_DIGIT Envelope-from freemail username ends in
digit (<sender_address>[at]yahoo.com)
0.2 FREEMAIL_REPLYTO_END_DIGIT Reply-To freemail username ends in digit
(<sender_address>)
0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider
(<sender_address>)
2.0 BAYES_50 BODY: Spamwahrscheinlichkeit nach Bayes-Test: 40-60%
[score: 0.4760]
0.0 HTML_MESSAGE BODY: Nachricht enthält HTML
0.0 T_DKIM_INVALID DKIM-Signature header exists but is not valid
1.2 RDNS_NONE Delivered to internal network by a host with no rDNS
0.0 T_REMOTE_IMAGE Message contains an external image
But not all of the messages do have that detailed report, some are
just put into the SPAM folder.
Any hints will be appreciated!
Wolf