[SAtalk] perl programming

2002-12-01 Thread vlad f kropachew
Hello. Can anybody help with following troubles? I wrote simple perl script based on POD documentation for spamassassin: #!/usr/bin/perl use Mail::SpamAssassin; my ($mail, $spamtest, $status, $spamdir); my $f = new Mail::SpamAssassin(); $f-load_scoreonly_sql

Re: [SAtalk] spamarchive.org corpuses have quite low success rates with SA

2002-12-01 Thread Justin Mason
Michael Bell said: Kinda hard to say. Most of it IS spammy and valid MIME as far as I could tell. I did catch a few clearly-non-spam (evite) things in the corpus. The lack of Received lines does mess up quite a few DNS related tests (RBL, MX records) but I wouldn't think that alone made

Re: [SAtalk] Feature request (maybe)

2002-12-01 Thread Daniel Quinlan
Rob MacGregor [EMAIL PROTECTED] writes: I'd really like to see a blacklist_to option somewhere, or some other way of automatically tagging a given destination as indicating 99.99% probable spam. Basically, I've got an email address that I have to have to host some web pages on my ISP. The

Re: [SAtalk] FYI: Tweaks made to a great v2.43

2002-12-01 Thread Daniel Quinlan
Smart,Dan [EMAIL PROTECTED] writes: Classification: PUBLIC If anyone is interested I run a low/medium volume (35,000/day), site-wide SA corporate installation behind Postfix. Here's my local.cf updates that seem to have tuned things well for me. BTW: I kill messages 7.0 and

Re: [SAtalk] spamarchive.org corpuses have quite low success rates with SA

2002-12-01 Thread Michael Bell
--- Justin Mason [EMAIL PROTECTED] wrote: I haven't looked yet, but (a) if they're not well-cleaned (ie if there is valid nonspam in there), it's going to seriously impact the archive's usefulness. It's not well-cleaned. In a random survey of 5 spam files, one was clearly a valid

Re: [SAtalk] spamarchive.org corpuses have quite low success rates with SA

2002-12-01 Thread Matthew Davis
* Michael Bell ([EMAIL PROTECTED]) wrote: Agreed. I think it's worthless too. Just wanted to bring up the topic, so we could all be prepared for newbies asking the question. Now we have a thread to point to Here's an example of their substandard corpus. Note that while looking for an

Re: [SAtalk] spamarchive.org corpuses have quite low success rates with SA

2002-12-01 Thread Michael Bell
A fair statement as to what it is good for,yes. It could be used for bayesian body stuff - dunno how that's stacked up in your tests (which I notice do include most headers) - but it's pretty limited otherwise. Note that the PR for these guys (CipherMail or whatever $25000 box it's called

Re: [SAtalk] spamarchive.org corpuses have quite low success rates with SA

2002-12-01 Thread Daniel Quinlan
Michael Bell [EMAIL PROTECTED] writes: Out of an evil sense of malice g, here's an example of one of their falsely included messages which IMO doesn't belong in the corpus - it is simply NOT spam per se. That message doesn't appear to be spam, but it could be. Spammers often disguise their

Re: [SAtalk] FYI: Tweaks made to a great v2.43

2002-12-01 Thread +archive . spamassassin-talk
--On Tuesday, November 26, 2002 11:26 AM -0600 Smart,Dan [EMAIL PROTECTED] wrote: | header RCVD_IN_WIREHUB rbleval:check_rbl('relay','blackholes.wirehub.net') | describe RCVD_IN_WIREHUBVMC-Received via a relay in WIREHUB | score RCVD_IN_WIREHUB 2.0 Couldn't this be altered to