General question about SA default ruleset

2007-07-11 Thread Fabien GARZIANO
Hi Folks,

I can't get nowhere, Wiki, FAQ, mail archive, this thing. I'm looking
for a kind of dictionnary providing an explanation for each rule of
the default ruleset provided with spamassassin ... Sorry for my poor
english. Let's have an exemple : 

RCVD_ILLEGAL_IP : In my .cf files, I only got Received: contains
illegal IP address. I would like to know what exactly means an illegal
IP (looks like it can sometimes be an IP block undefined or some other
weird thing). 

In fact, if you know if this kind of index/dictionnary like this exists,
and if so, where it can be found, I would really appreciate. If this
doesn't exist, I would like to try to make one myself. If someone got
any information, that would be nice. I precide I'm not looking for
something like 
http://systems.cs.uoregon.edu/Solaris/spamassassin.php
Or 
http://www.nesox.com/document/Spamassassin%20Explanation.asp

That basically just rewrite what can be found on .cf files ... 

Thanks.


RE: Spamassassin detailed log entries

2006-10-19 Thread Fabien GARZIANO

 De : Bowie Bailey Envoyé : mercredi 18 octobre 2006 18:17
 What I do is this:
 
   add_header all Report _REPORT_
 
 This gives me the detailed X-Spam-Report header listing the 
 scores, rule names, and rule descriptions.

Thanks for the answer. I've tried most add_header options (like all or 
_TESTSSCORES(,)_, the mail headers are the way I want them. But I can't get 
the same modifications in the maillog file. This is for my qmail/spamassassin 
V3.1.6 setup. I also got a MailScanner/postfix/spamassassin V3.0.5 and the log 
file (/var/log/maillog) contains TESTS triggered + SCORES. 

 De : Theo Van Dinter 18 octobre 2006 16:44 
 I cant find how to ask spamassassin (spamd in my case) to write 
 detailed score in my log files.
 There's currently no way to modify the log output from a config 
 file.  You'd have to modify the spamd code to change its 
 log output.

So if I understand, in my MailScanner setup (where the log is perfectly what I 
need), it's not spamassassin that write the logs this way but MailScanner ? 


Spamassassin detailed log entries

2006-10-18 Thread Fabien GARZIANO
Hi All,


I cant find how to ask spamassassin (spamd in my case) to write detailed
score in my log files. I'm pretty sure it's in my local.cf file (cause
it's system wide), but I can't find which command to insert. I've seen
on the spamassassin wiki the _TESTSSCORES(,)_ tag, but I dont find in
which option it fits.

--

F a b i e n   G a r z i a n o


RE: Need help with several things in SA

2006-10-10 Thread Fabien GARZIANO
 
Thanks Matt for this long explanation. I agree with the fact that you
should avoid raising rules score or think twice before doing it. A lot
of trouble may appear with a rule with a to high score. I got in trouble
at the beginning with that. I raised some scores very high (more than
20), and I got AWL (see
http://wiki.apache.org/spamassassin/AutoWhitelist) running. When I
realized I made a mistake with that scores, I lowered them down. But AWL
kept on score high, logically ... 

Now I think the only score rules I change are RBL, URIBL etc. And I
check my bayes scoring regulary ... 

By the way, anyone knows where I can find an explanation for each rules
of the default sa ruleset ? I know, most of the time, the title or desc
are explicit, but sometimes not. I've search (maybe not enough) the Wiki
but didnt found ... 

thanks


 -Message d'origine-
 2) finding and testing some of the add-on rulesets to expand 
 the diversity of rules in your SA set.  Generally speaking, 
 you'll get fewer FPs from 2 rules that score 2.5 each on a 
 particular spam than you will from 1 rule scoring 5.0.
 
 
 


Low spam score but marqued as spam

2006-10-06 Thread Fabien GARZIANO
Hi folks,
 
I'm in trouble with 1 particular mail and in front of a strange problem.
I got a mail which my spamassassin score 1.5 (the score limit before
spam is identified is 5.2) but it tag it like spam anyway
 
X-caliseo-MailScanner: Found to be clean
X-caliseo-MailScanner-SpamCheck: spam, SBL+XBL,
 SpamAssassin (score=1.562, requis 5.8, BAYES_00 -2.60,
 HTML_90_100 0.02, HTML_FONT_BIG 0.14, HTML_MESSAGE 0.00,
 HTML_NONELEMENT_00_10 0.00, HTML_SHOUTING3 0.02, RCVD_IN_XBL 3.98)
X-caliseo-MailScanner-SpamScore: 1
X-caliseo-MailScanner-From: [EMAIL PROTECTED]
X-Spam-Status: Yes

SpamAssassin version 3.0.5
  running on Perl version 5.8.6

I can't find anything in the previous mails in here. And google give
nothing. 
Anyone got an idea ?

--

F a b i e n   G a r z i a n o


RE: Low spam score but marqued as spam

2006-10-06 Thread Fabien GARZIANO

I agree with you. I'll turn off this right now. I didn't know this and I 
realized I bother spamassassin mailing list for nothing, Sorry guys :-( 
Thanks for the answers Anthony and François.

Note : it's stranger that when I use online web XBL checks result is negative 
(no match). And now mails received from my gateway, from the same user, the 
same smtp server, are not XBL positive. Strange huh ?

 -Message d'origine-
 De : Anthony Peacock [mailto:[EMAIL PROTECTED] 
 Envoyé : vendredi 6 octobre 2006 15:37
 À : users@spamassassin.apache.org
 Objet : Re: Low spam score but marqued as spam
 
 Hi,
 
 François Rousseau wrote:
  I don't know MailScanner but maybe because they have found 
 the adress 
  in a XBL list?
  
  X-caliseo-MailScanner-SpamCheck: spam, SBL+XBL,
 
 
 Yes, MailScanner can be configured to do the RBL lookups 
 itself and to mark messages on that basis (this is regardless 
 of the SA score).
 
 This can be configured on the MailScanner.conf file:
 
 # If a message appears in at least this number of Spam 
 Lists (as defined # above), then the message will be treated 
 as spam and so the Spam # Actions will happen, unless the 
 message reaches the levels for High # Scoring Spam. By 
 default this is set to 1 to mimic the previous # behaviour, 
 which means that appearing in any Spam Lists will cause # 
 the message to be treated as spam.
 # This can also be the filename of a ruleset.
 Spam Lists To Be Spam = 1
 
 
 Personally I turn this off and let SA do its scoring.  The 
 MailScanner settings are a little too cut and dried for my 
 taste.  If I was to trust a RBL that much I would use it at 
 SMTP time anyway.
 
 
  
  
  
  2006/10/6, Fabien GARZIANO [EMAIL PROTECTED]:
 
  Hi folks,
 
  I'm in trouble with 1 particular mail and in front of a 
 strange problem.
  I got a mail which my spamassassin score 1.5 (the score 
 limit before 
  spam is identified is 5.2) but it tag it like spam anyway
 
  X-caliseo-MailScanner: Found to be clean
  X-caliseo-MailScanner-SpamCheck: spam, SBL+XBL, SpamAssassin 
  (score=1.562, requis 5.8, BAYES_00 -2.60, HTML_90_100 0.02, 
  HTML_FONT_BIG 0.14, HTML_MESSAGE 0.00, HTML_NONELEMENT_00_10 0.00, 
  HTML_SHOUTING3 0.02, RCVD_IN_XBL 3.98)
  X-caliseo-MailScanner-SpamScore: 1
  X-caliseo-MailScanner-From: [EMAIL PROTECTED]
  X-Spam-Status: Yes
 
  SpamAssassin version 3.0.5
running on Perl version 5.8.6
 
  I can't find anything in the previous mails in here. And 
 google give 
  nothing.
  Anyone got an idea ?
 
  --
 
  F a b i e n   G a r z i a n o
 
  
 
 
 --
 Anthony Peacock
 CHIME, Royal Free  University College Medical School
 WWW:http://www.chime.ucl.ac.uk/~rmhiajp/
 If you have an apple and I have  an apple and we  exchange 
 apples then you and I will still each have  one apple. But  
 if you have an idea and I have an idea and we exchange these 
 ideas, then each of us will have two ideas. -- George Bernard Shaw
 


RE: Problem with URIBL rules : false positive and not listed while mannually checking

2006-10-04 Thread Fabien GARZIANO

 What version of SpamAssassin are you running?  Versions before
 3.1 have an infrequent DNS query bug:
 
   http://bugzilla.spamassassin.org/show_bug.cgi?id=3997
 

I'm running SpamAssassin version 3.0.5. (On Perl 5.8.6).
I've checked the the bugzilla page about this bug. I dont understand a
damn thing 8-|... I guess that i need to update my spamassassin setup
and I'm scared. I'm gonna check the wiki for advice on spamassassin
updates, but first, get a horse shoe, and recite a hundred mantras !

 Another possibility is that there is a DNS proxy or DNS 
 modification service like OpenDNS changing the DNS results in 
 a way that's not compatible with SURBL applications:
 
   http://www.surbl.org/faq.html#opendns

I dont run any dns service on this box ... It's a clean MailScanner VM
and I dont see no process named 'dns' with ps ax

 In any case, none of the domains mentioned are blacklisted, 
 so there is a problem with your SpamAssassin or DNS.

About the checks, did you use
http://www.rulesemporium.com/cgi-bin/uribl.cgi ?
Do you know a way to see result for each test (PH, OB, etc ... ) ?

Thank you for this anwser Jeff


RE: Problem with URIBL rules : false positive and not listed while mannually checking

2006-10-04 Thread Fabien GARZIANO
 I did a local DNS query:
 
   dig somedomain.com.multi.surbl.org a
 
 If you get NXDOMAIN then it's not listed.
 
  Do you know a way to see result for each test (PH, OB, etc ... ) ?
 
   dig somedomain.com.multi.surbl.org txt
 
 will show the lists; so will the lookup page, and so will:
 
   spamassassin -D  some_message_in_a_file

Thanks a lot for the tip with dig. That's what I was looking for. 

 There's usually some DNS service on the box or on your local 
 or ISP network.  If you're on a Unix/Linux/BSD box it's 
 usually called 'named'.  As long as DNS isn't doing anything 
 unusual, then it's a non-issue.  Just use normal, default DNS 
 service if your message volume is less than 100k to 250k per day.

And for dns, I'm sorry, I typed it too fast and when I meant no 'dns' i
also meant no 'named' process. 
On this box, i've tried
 :# dig nortel.com.multi.surbl.org a
And it returned me NXDOMAIN as you said, so I guess it may not be a dns
problem on this box.
(the DNS serveur answering is my ISP's).
I think i'm gonna update Spamassassin anyway, it should be a good reason
to do it.

Thanks for all this goods anwsers ! 
P.S : sorry Jeff if you receive this Email twice


RE: Stock spam in images

2006-10-02 Thread Fabien GARZIANO
 
Have been answered few threads ago and more... May be you didn't scan enough ^^

You can use FuzzyOCR module (But dont ask me how to use, I've never tried ^^)

-Message d'origine-
De : Dylan Bouterse [mailto:[EMAIL PROTECTED] 
Envoyé : lundi 2 octobre 2006 15:38
À : users@spamassassin.apache.org
Objet : Stock spam in images

I'm a newbie to the list and have been scanning recent posts to see if what I'm 
about to ask about has been covered but I haven't seen anything yet.

Lately I have been getting more and more of the stock alert spam but now all 
the good info is in an image and typically following the image is random text 
to fool the Bayesian filter. I think the random text thing has been covered 
here recently. It's frustrating when sa is giving a
-1.6 (or so) score to these emails right off the bat. Quite a few of these 
aren't even getting spam headers because they aren't scoring high enough. Is 
there some magical trick to help score these messages higher?
Maybe a future version of sa will incorporate an OCR module? :)

Dylan


RE: Stock spam in images

2006-10-02 Thread Fabien GARZIANO
 
Too bad, cause I agree with Giampaolo, it would be great. What about making a 
plugin including OCR components but instead of using inner dictionnary, passing 
it back to spamassassin through the MTA... Yeah, I know, the load will increase 
... But that would be nice ?

...

... Ok,I go back to sleep 

-Message d'origine-
De : Randal, Phil [mailto:[EMAIL PROTECTED] 
Envoyé : lundi 2 octobre 2006 16:19
À : users@spamassassin.apache.org
Objet : RE: Stock spam in images

Giampaolo Tomassoni wrote:

 And, by the way, it seems to work!
 
 Actually, the only limit I see is the own-made FuzzyOcr.words (and, 
 maybe, the fact that script text may probably get undetected). 
 Wouldn't it be better to inject the detected text back to SA? There 
 should be enough variants of spam worlds to let SA fuzzily catch the 
 ones from images.
 
 Am I wrong?

I think so.  Some of the words would be perfectly legitimate in the text of 
emails but rarely found in attached legitimate images.

Quite apart from the fact that Spamassassin isn't designed for reinjection.

Cheers,

Phil
--
Phil Randal
Network Engineer
Herefordshire Council
Hereford, UK


Re: bayes sync is hogging cpu

2006-09-26 Thread Fabien GARZIANO
 
Ok, I may say something dumb, but have you tried to clear the bayes db
with : 
sa-learn --clear --dbpath

-- Fab