Re: Upgrade my rules to Steroid Strength?

2004-12-12 Thread jdow
From: "Peter Matulis" <[EMAIL PROTECTED]>
> --- Michele <[EMAIL PROTECTED]> wrote:
> >
> > Are you using DCC and Bayes?
>
> I, for one, would like to use Bayes.  The docs say it is enabled by
default.  How does one
> begin testing or using this?

You must train first. The Bayes filter is only as good as its training.

Segregate out 200 or more known good messages and 200 or more known bad
messages. (More is better.) Feed the ham messages to "sa-learn --ham"
and the spam messages to "sa-learn --spam". Then, basically, it should
start working. It'll work better the more you feed it. And do NOT mix
up spam and ham in your training. Furthermore if your "mail boxes" are
in single files for multiple messages, mbox format" you must add the
"--mbox" option ot both "sa-learn" invocations above.

{^_^}




Undo what autolearn has done

2004-12-12 Thread Jeff Ramsey
When I first setup version 3.0.X, I turned on auto-learn. It let way too
many messages through. I have turned it off, and I am now using a
slightly modified version of the 'learnspam' script from the wiki.

My problem is that during the auto-learn days some messages got auto
whitelisted, and now I cannot seem to get rid of them. Is there a way
that I can 'unlearn' these messages, or just start SA's learn process
over?

Thanks,

-- 
Jeff Ramsey
MIS Administrator
Tubafor Mill, Inc.



Re: Upgrade my rules to Steroid Strength?

2004-12-12 Thread Peter Matulis
 --- Michele <[EMAIL PROTECTED]> wrote: 
> 
> Are you using DCC and Bayes?

I, for one, would like to use Bayes.  The docs say it is enabled by default.  
How does one
begin testing or using this?

__ 
Post your free ad now! http://personals.yahoo.ca


Re: Upgrade my rules to Steroid Strength?

2004-12-12 Thread Michele
Are you using DCC and Bayes?
--
Email scanned by Blacknight for viruses and dangerous content.
Visit http://www.blacknight.ie for more information


Upgrade my rules to Steroid Strength?

2004-12-12 Thread Rob Blomquist
I finally have figured out how to set up fetchmail and maildrop to use SA to 
filter my messages. It only took a bunch of figuring out how email works, 
bouncing several hundred messages, losing contact with several listservers 
(probably due to the bounces) and a whole bunch of frustration with the lack 
of email.  As a desktop hobbyist running Linux, it was a great learning 
opportunity.

But now, I need some better rules. I run RDJ with the MRDJ wrapper, and I have 
been running these based on the time they took to process mail. But that is 
not my concern anymore. I really want to run with near 100% clean. Right now 
I am more like 70% clean.

I rather like the MRDJ way of updating, and I am not up for the custom method 
of writing my own rules, as I want to keep Linux a hobby until I am paid for 
it.

Here are the trusted rulesets I am using:
SARE_OEM 
SARE_GENLSUBJ 
SARE_GENLSUBJ_ENG 
SARE_CODING SARE_HEADER1 
SARE_HEADER2 
SARE_BML 
SARE_FRAUD 
SARE_SPOOF 
SARE_UNSUB 
SARE_RANDOM SARE_TOP_200 
BOGUSVIRUS 
ANTIDRUG 
TRIPWIRE 
EVILNUMBERS 
SARE_SPECIFIC

The spam I am missing can't be attached, as mail.apache.org is seeing this 
message as spam if I add them. But ingeneral, it is some of the male pills, 
some that have bizarre phrases and an HTML image, and some of the standard 
Rolex spams. I hope this message makes it.

Rob


Re: Should tagged spam be fed back to server?

2004-12-12 Thread Clarke Brunt
> When training spamassassin, is it a good idea to feed spam already
> marked as spam back to SpamAssassin?  Will this help it or hinder it or
> do neither?

As ever with SpamAssassin, there are many options, and really only you can
decide how best to use them, after studying the documentation.

If you mean training with the 'sa-learn' command, then it depends on whether
the message has been automatically leaned already. Check for autolearn= in
the message headers (if it isn't there, then configure things so that it
is).
If the value is already 'spam', then it's been learned already (and will be
ignored if learned again). It the value is 'no' (or worse 'ham') then you
could usefully sa-learn it.

On the other hand, there's reporting with 'spamassassin --report...' (which
learns as for sa-learn, but also reports the spam to wherever. e.g. Razor,
Spamcop, etc.). My thoughts are that if the message was so spammy as to get
a high score, then there's probably little point (I reject these at SMTP
time, so I don't get chance to report them). But if it's only slightly
spammy, or isn't detected at all, then reporting it seems a good idea.

And if you're worried about the message already containing SpamAssassin
markup, then it's supposed to remove it automatically.

Clarke Brunt




Should tagged spam be fed back to server?

2004-12-12 Thread Steve Dondley
When training spamassassin, is it a good idea to feed spam already 
marked as spam back to SpamAssassin?  Will this help it or hinder it or 
do neither?


Re: Could not create INET Socket

2004-12-12 Thread Matt Kettler
At 02:26 PM 12/11/2004 -0800, Rob Blomquist wrote:
I am trying to get fetchmail, maildrop and spamassassin happily running
together, and most things are working except between maildrop and SA. I am
getting this message:
fetchmail: reading message [EMAIL PROTECTED]:1 of 1 (1122 octets)
#Could not create INET socket on 127.0.0.1:783: Permission denied
(IO::Socket::INET: Permission denied)
maildrop: error writing to filter.
the call to SA is xfilter "spamd"
You really should not be calling spamd in your mail chain, as it does not 
accept email as input. You should be calling spamc.

spamd should already have been started and daemonized by root before hand.


Re: 3.0.1 header rewrite

2004-12-12 Thread Matt Kettler
At 07:40 PM 12/11/2004 -0800, jdow wrote:
Is it possible to have a version the _SCORE_ parameter that prints out
numbers in the old "09.66" style with the leading zero? This is a handy
feature in a mailer that can sort by email by subject. If the spam is
all marked "*SPAM* " followed by the four (or 5) digits of
spam value the sorting is automatic. It makes it easier to scan spams
with values below about 10 to 15 for the rare ham that makes it through.
The 3, 4, or more digits approach does not sort by value correctly.
In SA 3.0.x you can.. From the man Mail::SpamAssassin::Conf manpage:
 _SCORE(PAD)_  message score, if PAD is included and is either spaces or
   zeroes, then pad scores with that many spaces or zeroes
   (default, none)  ie: _SCORE(0)_ makes 2.4 become 02.4,
   _SCORE(00)_ is 002.4.  12.3 would be 12.3 and 012.3
   respectively.





Re: recognizing spam sent by viruses

2004-12-12 Thread hamann . w

>> Also, do these viruses ever use the SMTP server set up on that computer in
>> Outlook (via Windows registry), or do they always use external third-party
>> smtp servers for sending the spam?
>> 
>> (I'm researching this as a prerequisite to trying some additional anti-spam
>> strategies.)
 
>> always <<
I have seen some rubbish in german, a while ago, and seemingly built for the 
german infrastructure:
A high percentage of private internet access goes through a provider that does 
not authenticate
smtp, since they already have authenticated the connection.
The rubbish took advantage of that and was sending via the official mail server

A lot of other spam I receive is sent straight from the client's ip address to 
the target MX,
where outlook would - hopefully - send to the smarthost server first

Wolfgang





3.0.1 header rewrite

2004-12-12 Thread jdow
Is it possible to have a version the _SCORE_ parameter that prints out
numbers in the old "09.66" style with the leading zero? This is a handy
feature in a mailer that can sort by email by subject. If the spam is
all marked "*SPAM* " followed by the four (or 5) digits of
spam value the sorting is automatic. It makes it easier to scan spams
with values below about 10 to 15 for the rare ham that makes it through.
The 3, 4, or more digits approach does not sort by value correctly.

{^_^}



recognizing spam sent by viruses

2004-12-12 Thread Rob McEwen






Is there some way of knowing for **sure** whether a spam which does NOT actually contain a virus was sent by a virus? Is there a corpus of such examples anywhere? Also, do these viruses ever use the SMTP server set up on that computer in Outlook (via Windows registry), or do they always use external third-party smtp servers for sending the spam? (I’m researching this as a prerequisite to trying some additional anti-spam strategies.) Thanks, Rob McEwen






Re[2]: Rude spammers

2004-12-12 Thread Robert Menschel
Hello Jeff,

Saturday, December 11, 2004, 12:11:18 AM, you wrote:

JC> On Friday, December 10, 2004, 11:59:35 PM, Robert Menschel wrote:

JC>>> But "Get a capable html e-mailer" could also be generic
JC>>> text for non-MIME or non-HTML capable mail clients to see. ...

>> I know that my company would never think of saying anything like that
>> to any of our customers nor our vendors.

JC> I've seen similar portions of messages with less rude wording
JC> but similar meaning, e.g., "this message can only be properly
JC> viewed with an HTML-capable program" or something similar.

Yes, the measure of rudeness is a major decision factor.

As much as I would rather receive all email in plain text format, with
a link to a web page if graphics or other special effects were
desired, we must be able to receive emails that have embedded HTML,
and even newsletters that come in ONLY HTML, with statements like you
give.

But IMO any HTML-Only email which actively insults the recipient, or
has the audacity to command the recipient to use an HTML-email client,
is doomed here to the bit bucket.

I process dozens of newsletters here, most of which include HTML
formatting.  All but one or two provide reasonably text along with
their HTML. All that use significant HTML suggest (repeat: suggest,
not demand) that their newsletter can be better read using their web
site, with link provided, or with an HTML email client.

The only emails I've ever seen come through that are insulting or
demanding in this way have been spam, scam, and phish.

So again, yes, we cannot block emails on any and all types of messages
which imply that the message is in HTML. However, we can /judiciously/
include rules which test for the more obnoxious such statements, and
score them accordingly.

I expect that in my three domains
> Get a capable html e-mailer
on a line by itself, will score somewhere between 50% and 90% of my
spam threshold.

Bob Menschel



Re: some mails are not tagged

2004-12-12 Thread jdow
From: "jdow" <[EMAIL PROTECTED]>

> From: "Loren Wilton" <[EMAIL PROTECTED]>
>
> > > since the upgrade from spamassassin 2.61 to 3.01. i sometimes
experience
> > > a strange problem. the subject of some mails is not rewritten with
> > > *SPAM even if the score is high enough and the report attached
> > > to the headers says its spam:
> > >
> > > X-Spam-Checker-Version: SpamAssassin on ianus
> > > X-Spam-Status: Yes, hits=8.0 required=5.0
> tests=BAYES_99,MISSING_HEADERS,
> > > MISSING_SUBJECT,NO_REAL_NAME,RCVD_IN_BL_SPAMCOP_NET,RCVD_IN_XBL,
> > > SPF_SOFTFAIL autolearn=no
> >
> > This is a FAQ question, or at least should be.
> >
> > MISSING_SUBJECT - the original mail doesn't have a subject.
> >
> > SA *RE*writes headers.  It doesn't *create* headers that weren't there.
> > So since there was no subject, there was no subject to rewrite, so there
> was
> > no place to put the tag.
> >
> > There is an enhancement open on this, and I had thought that it actually
> > made it in to 3.1.0 or so; but I could well be mistaken about that.
> >
> > Loren
>
> In the mean time a procmail/formail rule should be able to toss in a
> dummy subject. I'll see if I can think of what the procmail search
> string should look like. (I suspect a subject of all blanks, say 20 to
> 40 of them, would still trigger that behavior.)
>
> {^_^}

Actually it is already a ratware rule. (I just cleaned up 99_OBFU_drugs.cf
and ratware.cf. No more lint errors on 3.0.1! If the appropriate SARE
person emails me I'll send 'em in return email.)

{^_^}