Re: Spam not getting scanned

2007-02-16 Thread Magnus Holmgren
On Thursday 15 February 2007 15:48, Dave Williss wrote: Is there some Spamassassin rule that may be auto-whitelisting this (because the forged sender is an actual account), or is Postfix confused into thinking that the sender is local and just not running it through SA? Now that I think about

WhitelistRcvdIP plugin 0.2

2007-02-16 Thread Raul Dias
For some reason I had 3 lines deleted in the plugin when I sent it. here is the correct version. -Raul Dias On Thu, 2007-02-15 at 15:26 -0300, Raul Dias wrote: Hi, I, as other people, needed to whitelist addresses that came from servers with no rdns. whitelist_from is not a good option,

worst spammer ever

2007-02-16 Thread Raul Dias
What happens when a clueless newbie tries to pass spam? This happens = http://www.dias.com.br/worst-spam.txt -Raul Dias

Re: worst spammer ever

2007-02-16 Thread Matt Richards
Raul Dias wrote: What happens when a clueless newbie tries to pass spam? This happens = http://www.dias.com.br/worst-spam.txt -Raul Dias '192.168.0.%RND_DIGIT'lol :) i can just see getting an email with an IP of 192.168.0.450 and whats the point in trying to fake it from a 192.168

Re: worst spammer ever

2007-02-16 Thread Justin Mason
Matt Richards writes: Raul Dias wrote: What happens when a clueless newbie tries to pass spam? This happens = http://www.dias.com.br/worst-spam.txt -Raul Dias '192.168.0.%RND_DIGIT'lol :) i can just see getting an email with an IP of 192.168.0.450 and whats the point in

Re: Bayes training question

2007-02-16 Thread Steven Stern
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 yossim wrote: Hi folks, Hi can i learn miss identified junk mail that is store on exchange or at the otulook clients? Can i simply copy those mails to a folder on my Linux server and run sa-learn with the required parameters? Kindly regards, Yossi

Re: Spam not getting scanned

2007-02-16 Thread maillist
Dave Williss wrote: I've started recieving a few spams a day that aren't even getting scanned by Spamassassin. Or at least they don't get any X-Spam headers added on. The messages in question all have forged senders to make them look like they came from an existing user within my own domain

Re: [2] Bayes training question

2007-02-16 Thread yossim
Thanks Steven, I appriciate your quick response :) I will review the link and try to implement it on our site. Regards, Yossi Steven Stern wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 yossim wrote: Hi folks, Hi can i learn miss identified junk mail that is store on

Google Summer of Code 2007 ...

2007-02-16 Thread Justin Mason
Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last year that we could

Re: Google Summer of Code 2007 ...

2007-02-16 Thread C. Bensend
Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? Perhaps this is trivial, or not desired by anyone else but myself, but I'd _love_ to be able to strip SpamAssassin tags via spamc

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Doc Schneider
Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last

Re: Google Summer of Code 2007 ...

2007-02-16 Thread DAve
Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? I believe this was once mentioned on a Justin's blog (but can't find a ref now), the following sounds promising as an additional

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Justin Mason
Mark Martinec writes: Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? I believe this was once mentioned on a Justin's blog (but can't find a ref now), the following sounds

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Dan
On Feb 16, 2007, at 7:35, Justin Mason wrote: We still have a number of items from last year that we could use again. Anything else that we'd like people to code up? Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Justin Mason writes: Also, a related project would be to complete the pluginization of our Bayes engine and APIs, so that other probabilistic classifiers can be plugged in in place of, or in addition to, Bayes in SpamAssassin. Right. I felt a need for something like this when I was switching

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Chris St. Pierre
On Fri, 16 Feb 2007, Mark Martinec wrote: I believe this was once mentioned on a Justin's blog (but can't find a ref now), the following sounds promising as an additional classifier to existing bayes (especially since the author comes from the same organization as myself :)

Re: Increase in image spam

2007-02-16 Thread LuKreme
On 6-Feb-2007, at 09:30, Sujit Choudhury wrote: Lately there has been an increase in image spam. We are using imageinfo.cf with ImageInfo plugin. However, this is not making a lot of difference. We are also using virtually all the SARE rules plus using sa-update and restarting spamd everyday.

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Duncan Findlay
On Fri, Feb 16, 2007 at 09:31:13AM -0800, Dan wrote: On Feb 16, 2007, at 7:35, Justin Mason wrote: We still have a number of items from last year that we could use again. Anything else that we'd like people to code up? Also, any suggestions from outside the dev team? Anyone got good ideas

Re: Should I use greylisting

2007-02-16 Thread LuKreme
On 29-Jan-2007, at 12:24, [EMAIL PROTECTED] wrote: While I generally believe that end users should send thru a smarthost, I also think it is a bad idea to restrict them to the network provider's smarthost. They might prefer to send via their company's SMTP instead That's what port 587

Re: HarrisPoll

2007-02-16 Thread Theo Van Dinter
On Fri, Feb 16, 2007 at 11:26:22AM -0700, LuKreme wrote: Where does the WS-SURBL info come from? I ask because the Harris Poll email is getting tagged with it. As far as I know, I've never received spam from them, so I'd like to check out the actual rbl. You'd want to talk to the

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Bart Schaefer
On 2/16/07, Justin Mason [EMAIL PROTECTED] wrote: Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3785

Export and append Bayes DB

2007-02-16 Thread Sam Przyswa
Hi, Is it possible to export a Bayes DB from a server and then append (not restore) it to others servers ? Sam. -- Ce message a été vérifié par MailScanner pour des virus ou des polluriels et rien de suspect n'a été trouvé.

Re: Google Summer of Code 2007 ...

2007-02-16 Thread John D. Hardin
On Fri, 16 Feb 2007, Justin Mason wrote: Also, a related project would be to complete the pluginization of our Bayes engine and APIs, so that other probabilistic classifiers can be plugged in in place of, or in addition to, Bayes in SpamAssassin. +1 -- John Hardin KA7OHZ

Re: [2] Bayes training question

2007-02-16 Thread yossim
Steve, I am not sure that i have anderstand the first script starting with get_ham_spam were you use fetchmail (where the data is kept?) and the last one get-ham-spam when you used wget command to get all the ham.spam emails. Kindly regards, Yossi yossim wrote: Thanks Steven, I

Re: Google Summer of Code 2007 ...

2007-02-16 Thread John Rudd
John D. Hardin wrote: On Fri, 16 Feb 2007, Justin Mason wrote: Also, a related project would be to complete the pluginization of our Bayes engine and APIs, so that other probabilistic classifiers can be plugged in in place of, or in addition to, Bayes in SpamAssassin. +1 If that's a

Re: HarrisPoll

2007-02-16 Thread Otto TheBusDriver
On 2/16/07, LuKreme [EMAIL PROTECTED] wrote: Where does the WS-SURBL info come from? I ask because the Harris Poll email is getting tagged with it. As far as I know, I've never received spam from them, so I'd like to check out the actual rbl. 3.0 URIBL_WS_SURBL Contains a URL listed

RE: Google Summer of Code 2007 ...

2007-02-16 Thread Matthew Wilson
- Full, tested, supportable multithreaded support - Full, tested, supportable support for an asynchronous I/O model (a la qpsmtpd-async) - Pluggable to the point where all configuration and settings can be pulled from anywhere (databases, files, in-memory cache) at runtime, so SA could stay

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Raul Dias
On Fri, 2007-02-16 at 15:35 +, Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects.

How do disable user specific configuration?

2007-02-16 Thread Robert Nicholson
Can anybody tell me how you are suppose to completely disable the user configuration? I do not want ~/.spamassassin and if I must create this directory I want it relative to prefix as I maintain two different releases of SA installed into a shell account's home directory. Since I'm the

Bayes db size....

2007-02-16 Thread Robert Nicholson
Why then is my Bayes DB 20MEG in size right now if =item bayes_expiry_max_db_size (default: 15) What should be the maximum size of the Bayes tokens database? When expiry occurs, the Bayes system will keep either 75% of the maximum value, or 100,000 tokens, whichever has a

Re: Bayes db size....

2007-02-16 Thread Theo Van Dinter
On Fri, Feb 16, 2007 at 05:42:13PM -0600, Robert Nicholson wrote: Why then is my Bayes DB 20MEG in size right now if =item bayes_expiry_max_db_size (default: 15) That's in number of tokens, not physical size in bytes. 100,000 tokens, whichever has a larger value. 150,000 tokens

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? Here's another one, to seize the opportunity when internal changes are being contemplated: Split the process into two parts: -

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Matthew Wilson wrote: - Full, tested, supportable multithreaded support - Full, tested, supportable support for an asynchronous I/O model (a la qpsmtpd-async) I think effort could be better spent elsewhere. Spam checking lands itself ideally to running parallel individual processes, with

Re: [2] Bayes training question

2007-02-16 Thread Steven Stern
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 yossim wrote: Steve, I am not sure that i have anderstand the first script starting with get_ham_spam were you use fetchmail (where the data is kept?) and the last one get-ham-spam when you used wget command to get all the ham.spam emails. Kindly

Re: Bayes db size....

2007-02-16 Thread Robert Nicholson
So you're saying that right now seen isn't capped like tokens right? On Feb 16, 2007, at 5:45 PM, Theo Van Dinter wrote: On Fri, Feb 16, 2007 at 05:42:13PM -0600, Robert Nicholson wrote: Why then is my Bayes DB 20MEG in size right now if =item bayes_expiry_max_db_size (default:

Re: Bayes db size....

2007-02-16 Thread Theo Van Dinter
On Fri, Feb 16, 2007 at 06:17:36PM -0600, Robert Nicholson wrote: So you're saying that right now seen isn't capped like tokens right? seen has no max size nor expiry features. -- Randomly Selected Tagline: Like any French restaurant in America, it was overpriced, noisy, moody, and would put

Re: Export and append Bayes DB

2007-02-16 Thread Michael Parker
Sam Przyswa wrote: Hi, Is it possible to export a Bayes DB from a server and then append (not restore) it to others servers ? No, you generally can't combine two bayes databases that way. Best bet is to pick the most complete one and use it. For more details see a really long post on the

Re: Bayes db size....

2007-02-16 Thread Robert Nicholson
Well then I only care about tokens and not repeated emails can I disable seen? On Feb 16, 2007, at 6:19 PM, Theo Van Dinter wrote: On Fri, Feb 16, 2007 at 06:17:36PM -0600, Robert Nicholson wrote: So you're saying that right now seen isn't capped like tokens right? seen has no max size

RE: Google Summer of Code 2007 ...

2007-02-16 Thread Matthew Wilson
-Original Message- From: Mark Martinec [mailto:[EMAIL PROTECTED] Sent: Friday, February 16, 2007 6:09 PM To: users@spamassassin.apache.org Subject: Re: Google Summer of Code 2007 ... Matthew Wilson wrote: - Full, tested, supportable multithreaded support - Full, tested,

Re: Bayes db size....

2007-02-16 Thread Theo Van Dinter
On Fri, Feb 16, 2007 at 06:45:51PM -0600, Robert Nicholson wrote: Well then I only care about tokens and not repeated emails can I disable seen? You can't disable it, but you can delete it, as previously stated. -- Randomly Selected Tagline: 54% of all statistics are made up. No, make that

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully utilize today's CPU I/O, and it's probably no big deal to provide about 2 GB of memory

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Justin Mason
Mark Martinec writes: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully utilize today's CPU I/O, and it's probably no big deal to

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Quinn Comendant
On Fri, 16 Feb 2007 15:35:39 +, Justin Mason wrote: We still have a number of items from last year that we could use again. Anything else that we'd like people to code up? How about an extensive statistics reporting tool, possible web-based, that can show how well a current spamassassin

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Raul Dias
On Sat, 2007-02-17 at 02:07 +0100, Mark Martinec wrote: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully utilize today's CPU I/O,

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Quinn Comendant
On Fri, 16 Feb 2007 18:01:37 -0800, Quinn Comendant wrote: And/or a fix for the qmail+simscan per-user preferences spamc -u issue where if an email is addressed to multiple users or an alias spamc isn't passed the correct user. Sorry to reply to myself, but I want to retract that last