Google Summer of Code 2007 - Students Wanted

2007-03-16 Thread Michael Parker
Howdy, The time of year for Google Summer of Code has already arrived and once again the Apache Software Foundation is taking part. We are currently looking for students who wish to work on SpamAssassin related projects over the summer. You have until *March 24th* to sign up and submit an

Re: [2] Google Summer of Code 2007 ...

2007-02-25 Thread Andrej Bratko
implementation. -- View this message in context: http://www.nabble.com/Google-Summer-of-Code-2007-...-tf3240085.html#a9146893 Sent from the SpamAssassin - Users mailing list archive at Nabble.com.

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Per Jessen
C. Bensend wrote: Perhaps this is trivial, or not desired by anyone else but myself, but I'd _love_ to be able to strip SpamAssassin tags via spamc and spamd, instead of having to fire up the full-blown spamassassin for each message. :) formail ? /Per Jessen, Zürich

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
DAve writes: Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
Matthew Wilson writes: - Full, tested, supportable multithreaded support In my experience, perl threading is just not avialable in a reliable, fast implementation -- this is not viable I'm afraid :( - Full, tested, supportable support for an asynchronous I/O model (a la qpsmtpd-async) A pretty

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
Mark Martinec writes: Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? Here's another one, to seize the opportunity when internal changes are being contemplated: Split the process

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
Mark Martinec writes: On Saturday February 17 2007 03:01, Quinn Comendant wrote: How about an extensive statistics reporting tool, ..., that can show how well a current spamassassin installation is performing and where it needs improvements. Well, not exactly by your words, but in the same

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
Doc Schneider writes: Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
Raul Dias writes: On Fri, 2007-02-16 at 15:35 +, Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up

Re: Google Summer of Code 2007 ...

2007-02-21 Thread C. Bensend
Perhaps this is trivial, or not desired by anyone else but myself, but I'd _love_ to be able to strip SpamAssassin tags via spamc and spamd, instead of having to fire up the full-blown spamassassin for each message. :) formail ? That would work in most cases, yes. Unfortunately, not in

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
Julian Field writes: Justin Mason wrote: DAve writes: Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Raul Dias
On Fri, 2007-02-16 at 15:35 +, Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
actually I think this is already implemented in 3.2.0 -- see http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4636 for details. --j. Raul Dias writes: On Fri, 2007-02-16 at 15:35 +, Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Julian Field
Justin Mason wrote: DAve writes: Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible

Re: Google Summer of Code 2007 ...

2007-02-21 Thread DAve
Justin Mason wrote: DAve writes: Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Raul Dias
) normalize_charset option is set to 1. -Raul Dias --j. Raul Dias writes: On Fri, 2007-02-16 at 15:35 +, Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last year that we could use again

RE: Google Summer of Code 2007 ...

2007-02-21 Thread R Lists06
May I ask... Whis is this thread named as such. Does Google help fund SA efforts in one or multiple ways? If so, may I ask how or directions to already posted docs on it? - rh -- Robert - Abba Communications Computer Internet Services (509) 624-7159 - www.abbacomm.net

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Daryl C. W. O'Shea
R Lists06 wrote: May I ask... Whis is this thread named as such. Does Google help fund SA efforts in one or multiple ways? If so, may I ask how or directions to already posted docs on it? If you, uh, Google for Google Summer of Code I'm sure you'll find all you want to know. Daryl

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Raul Dias
On Wed, 2007-02-21 at 17:27 +0100, Justin Mason wrote: Raul Dias writes: On Wed, 2007-02-21 at 15:29 +0100, Justin Mason wrote: actually I think this is already implemented in 3.2.0 -- see http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4636 for details. Nice. This patch

RE: Google Summer of Code 2007 ...

2007-02-21 Thread David B Funk
On Wed, 21 Feb 2007, R Lists06 wrote: May I ask... Whis is this thread named as such. Does Google help fund SA efforts in one or multiple ways? If so, may I ask how or directions to already posted docs on it? - rh -- Robert - Abba Communications Yes, if you Goole for Google Summer

Re: Google Summer of Code 2007 ...

2007-02-21 Thread Justin Mason
Raul Dias writes: On Wed, 2007-02-21 at 17:27 +0100, Justin Mason wrote: Raul Dias writes: On Wed, 2007-02-21 at 15:29 +0100, Justin Mason wrote: actually I think this is already implemented in 3.2.0 -- see http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4636 for details.

RE: Google Summer of Code 2007 ...

2007-02-21 Thread R Lists06
Yes, if you Goole for Google Summer of Code+spamassassin you'll get a bunch of relevant hits. ;) For example, check out: http://wiki.apache.org/spamassassin/SummerOfCode2006 Thank you I was hoping for meaningful and relevant info from someone of authority and in the know from the SA

Re: Google Summer of Code 2007 ...

2007-02-18 Thread Matthias Leisi
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Justin Mason wrote: Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? If I look at the tools and scripts I built around SA (and which

Re: Google Summer of Code 2007 ...

2007-02-18 Thread Justin Mason
Graham Murray writes: Theo Van Dinter [EMAIL PROTECTED] writes: Doesn't SA have at least 3 of those already? Razor, DCC, and Pyzor. Not quite. Those show how many times *others* have seen it, not how many times *I* have seen it. Also, these have hysteresis so if you are unfortunately to

Re: Google Summer of Code 2007 ...

2007-02-18 Thread Tim B.
Justin Mason wrote: Graham Murray writes: Theo Van Dinter [EMAIL PROTECTED] writes: Doesn't SA have at least 3 of those already? Razor, DCC, and Pyzor. Not quite. Those show how many times *others* have seen it, not how many times *I* have seen it. Also, these have hysteresis

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Justin Mason
Raul Dias writes: On Sat, 2007-02-17 at 02:07 +0100, Mark Martinec wrote: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Raul Dias
On Sat, 2007-02-17 at 11:21 +, Justin Mason wrote: Raul Dias writes: On Sat, 2007-02-17 at 02:07 +0100, Mark Martinec wrote: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Matthew Wilson
Raul Dias writes: **snip If I remember correctly spamd was using something between 2 to 5% of memory reported by top (45 process max). If it was really shared, it would have not collapsed. My bet is that the model used on Linux is copy on write. So after a fork, when the child spamd

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Chris St. Pierre
On Fri, 16 Feb 2007, Quinn Comendant wrote: How about an extensive statistics reporting tool, possible web-based, that can show how well a current spamassassin installation is performing and where it needs improvements. It could provide trends in different classes of spam and how each is

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Tim B.
Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Theo Van Dinter
On Sat, Feb 17, 2007 at 06:56:28PM -0500, Tim B. wrote: How about a How many times have I seen this message body plugin... So each time SA see's the same or similar enough message body, it increases the score. Doesn't SA have at least 3 of those already? Razor, DCC, and Pyzor. --

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Mark Martinec
On Saturday February 17 2007 03:01, Quinn Comendant wrote: How about an extensive statistics reporting tool, ..., that can show how well a current spamassassin installation is performing and where it needs improvements. Well, not exactly by your words, but in the same spirit, this time

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Graham Murray
Theo Van Dinter [EMAIL PROTECTED] writes: Doesn't SA have at least 3 of those already? Razor, DCC, and Pyzor. Not quite. Those show how many times *others* have seen it, not how many times *I* have seen it. Also, these have hysteresis so if you are unfortunately to be at the start of the spam

Re: Google Summer of Code 2007 ...

2007-02-17 Thread hamann . w
Not quite. Those show how many times *others* have seen it, not how many times *I* have seen it. Also, these have hysteresis so if you are unfortunately to be at the start of the spam run and receive multiple mails all with the same body then Razor, DCC and Pyzor might not help. Though if

Google Summer of Code 2007 ...

2007-02-16 Thread Justin Mason
Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last year that we could

Re: Google Summer of Code 2007 ...

2007-02-16 Thread C. Bensend
Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? Perhaps this is trivial, or not desired by anyone else but myself, but I'd _love_ to be able to strip SpamAssassin tags via spamc

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Doc Schneider
Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last

Re: Google Summer of Code 2007 ...

2007-02-16 Thread DAve
Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? I believe this was once mentioned on a Justin's blog (but can't find a ref now), the following sounds promising as an additional

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Justin Mason
Mark Martinec writes: Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? I believe this was once mentioned on a Justin's blog (but can't find a ref now), the following sounds

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Dan
On Feb 16, 2007, at 7:35, Justin Mason wrote: We still have a number of items from last year that we could use again. Anything else that we'd like people to code up? Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Justin Mason writes: Also, a related project would be to complete the pluginization of our Bayes engine and APIs, so that other probabilistic classifiers can be plugged in in place of, or in addition to, Bayes in SpamAssassin. Right. I felt a need for something like this when I was switching

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Chris St. Pierre
On Fri, 16 Feb 2007, Mark Martinec wrote: I believe this was once mentioned on a Justin's blog (but can't find a ref now), the following sounds promising as an additional classifier to existing bayes (especially since the author comes from the same organization as myself :)

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Duncan Findlay
On Fri, Feb 16, 2007 at 09:31:13AM -0800, Dan wrote: On Feb 16, 2007, at 7:35, Justin Mason wrote: We still have a number of items from last year that we could use again. Anything else that we'd like people to code up? Also, any suggestions from outside the dev team? Anyone got good ideas

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Bart Schaefer
On 2/16/07, Justin Mason [EMAIL PROTECTED] wrote: Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3785

Re: Google Summer of Code 2007 ...

2007-02-16 Thread John D. Hardin
On Fri, 16 Feb 2007, Justin Mason wrote: Also, a related project would be to complete the pluginization of our Bayes engine and APIs, so that other probabilistic classifiers can be plugged in in place of, or in addition to, Bayes in SpamAssassin. +1 -- John Hardin KA7OHZ

Re: Google Summer of Code 2007 ...

2007-02-16 Thread John Rudd
John D. Hardin wrote: On Fri, 16 Feb 2007, Justin Mason wrote: Also, a related project would be to complete the pluginization of our Bayes engine and APIs, so that other probabilistic classifiers can be plugged in in place of, or in addition to, Bayes in SpamAssassin. +1 If that's a

RE: Google Summer of Code 2007 ...

2007-02-16 Thread Matthew Wilson
- Full, tested, supportable multithreaded support - Full, tested, supportable support for an asynchronous I/O model (a la qpsmtpd-async) - Pluggable to the point where all configuration and settings can be pulled from anywhere (databases, files, in-memory cache) at runtime, so SA could stay

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Raul Dias
On Fri, 2007-02-16 at 15:35 +, Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Also, any suggestions from outside the dev team? Anyone got good ideas for new SpamAssassin features that would be good to pay someone to work on for 3 months? Here's another one, to seize the opportunity when internal changes are being contemplated: Split the process into two parts: -

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
Matthew Wilson wrote: - Full, tested, supportable multithreaded support - Full, tested, supportable support for an asynchronous I/O model (a la qpsmtpd-async) I think effort could be better spent elsewhere. Spam checking lands itself ideally to running parallel individual processes, with

RE: Google Summer of Code 2007 ...

2007-02-16 Thread Matthew Wilson
-Original Message- From: Mark Martinec [mailto:[EMAIL PROTECTED] Sent: Friday, February 16, 2007 6:09 PM To: users@spamassassin.apache.org Subject: Re: Google Summer of Code 2007 ... Matthew Wilson wrote: - Full, tested, supportable multithreaded support - Full, tested

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Mark Martinec
On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully utilize today's CPU I/O, and it's probably no big deal to provide about 2 GB of memory

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Justin Mason
Mark Martinec writes: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully utilize today's CPU I/O, and it's probably no big deal to

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Quinn Comendant
On Fri, 16 Feb 2007 15:35:39 +, Justin Mason wrote: We still have a number of items from last year that we could use again. Anything else that we'd like people to code up? How about an extensive statistics reporting tool, possible web-based, that can show how well a current spamassassin

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Raul Dias
On Sat, 2007-02-17 at 02:07 +0100, Mark Martinec wrote: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully utilize today's CPU I/O,

Re: Google Summer of Code 2007 ...

2007-02-16 Thread Quinn Comendant
On Fri, 16 Feb 2007 18:01:37 -0800, Quinn Comendant wrote: And/or a fix for the qmail+simscan per-user preferences spamc -u issue where if an email is addressed to multiple users or an alias spamc isn't passed the correct user. Sorry to reply to myself, but I want to retract that last