[VOTE] Release JAMES 2.3.0

2006-08-31 Thread Noel J. Bergman
I am still wanting to wait a few days to be sure, but in the meantime, we might as well vote in anticipation that there is no further problem. Since JAMES v2.3.0RC2, there have been the following changes: 1) UPGRADE.txt was added 2) default.properties changed to 2.3.0 from 2.3.0rc2 3) phoenix.

RE: Check for valid RCPT before accept email

2006-08-31 Thread Noel J. Bergman
> > Does it make sense to track how many valid & invalid addresses are > > provided for a message, and allow a threshold to consider that spam? > Hmm we could do so and reject such a message after the data. > any other thoughts ? Well, according to RFC 2821, 354 is not the only valid reply code t

Re: [jira] Created: (JAMES-602) Add feature to import and export BayesianAnalyzer data

2006-08-31 Thread Norman Maurer
I did the work here but im not happy with the import. It take ages to import the data. The export takes only a few second. I used the Xstream libary cause it provide an easy way. Any hints ? Am Donnerstag, den 31.08.2006, 14:47 -0700 schrieb Norman Maurer (JIRA): > Add feature to import and exp

[jira] Created: (JAMES-602) Add feature to import and export BayesianAnalyzer data

2006-08-31 Thread Norman Maurer (JIRA)
Add feature to import and export BayesianAnalyzer data --- Key: JAMES-602 URL: http://issues.apache.org/jira/browse/JAMES-602 Project: James Issue Type: New Feature Reporter: N

re: Check for valid RCPT before accept email

2006-08-31 Thread Norman Maurer
Hmm we could do so and reject such a message after the data. any other thoughts ? bye Norman Am Mittwoch, den 30.08.2006, 13:35 -0400 schrieb Noel J. Bergman: > Does it make sense to track how many valid & invalid addresses are provided > for a message, and allow a threshold to consider that sp

Re: svn commit: r438517 - /james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java

2006-08-31 Thread Vincenzo Gianferrari Pini
I was wrong, the ratio is "only" 1 to 1174 :-) . But I think substring() is still not so relevant... ;-) . Seriously, FYI, the reason for such high ratio is that substring() works at string level, while toLowerCase() breaks down to a character level loop, and in my profiling (using a Postage "

svn commit: r438957 - /james/server/trunk/src/conf/sqlResources.xml

2006-08-31 Thread norman
Author: norman Date: Thu Aug 31 08:49:49 2006 New Revision: 438957 URL: http://svn.apache.org/viewvc?rev=438957&view=rev Log: Add support for derby to use with BayesianAnalzer. Thx to Guillermo Grandes. See JAMES-555 Modified: james/server/trunk/src/conf/sqlResources.xml Modified: james/ser

svn commit: r438900 - /james/mime4j/trunk/bin/generate-test-data.sh

2006-08-31 Thread bago
Author: bago Date: Thu Aug 31 05:54:43 2006 New Revision: 438900 URL: http://svn.apache.org/viewvc?rev=438900&view=rev Log: Updated testmsgs folder reference in test script. Modified: james/mime4j/trunk/bin/generate-test-data.sh Modified: james/mime4j/trunk/bin/generate-test-data.sh URL: ht

svn commit: r438888 - /james/mime4j/trunk/examples/

2006-08-31 Thread bago
Author: bago Date: Thu Aug 31 05:23:08 2006 New Revision: 43 URL: http://svn.apache.org/viewvc?rev=43&view=rev Log: Copying maven2-port to trunk... Added: james/mime4j/trunk/examples/ - copied from r438886, james/mime4j/branches/maven2-port/examples/ --

svn commit: r438886 - /james/mime4j/trunk/src/

2006-08-31 Thread bago
Author: bago Date: Thu Aug 31 05:22:35 2006 New Revision: 438886 URL: http://svn.apache.org/viewvc?rev=438886&view=rev Log: Copying maven2-port to trunk... Added: james/mime4j/trunk/src/ - copied from r438885, james/mime4j/branches/maven2-port/src/

svn commit: r438885 - /james/mime4j/trunk/pom.xml

2006-08-31 Thread bago
Author: bago Date: Thu Aug 31 05:22:11 2006 New Revision: 438885 URL: http://svn.apache.org/viewvc?rev=438885&view=rev Log: Copying maven2-port to trunk... Added: james/mime4j/trunk/pom.xml - copied unchanged from r438884, james/mime4j/branches/maven2-port/pom.xml

svn commit: r438884 - in /james/mime4j/trunk: buildparser_address.bat buildparser_contenttype.bat buildparser_date.bat lib/ maven.xml project.properties project.xml samples/ src/ testmsgs/ xdocs/

2006-08-31 Thread bago
Author: bago Date: Thu Aug 31 05:21:01 2006 New Revision: 438884 URL: http://svn.apache.org/viewvc?rev=438884&view=rev Log: Removed trunk folders/files to leave space to the maven2-port merge Removed: james/mime4j/trunk/buildparser_address.bat james/mime4j/trunk/buildparser_contenttype.ba

Re: [jira] Closed: (JAMES-514) Add Mailet that support jason forspamfiltering

2006-08-31 Thread Vincenzo Gianferrari Pini
Just found the following in Gary Robinson's blog (http://radio.weblogs.com/0101454/stories/2002/09/16/spamDetection.html): Note 2: In calculating p(w) Graham counts every instance of word w in every email in which it appears. Obviously, if a word appears once in an email, there is a greater p

[jira] Assigned: (JAMES-555) Bayesiananalysis should support not only mysql and msql

2006-08-31 Thread Norman Maurer (JIRA)
[ http://issues.apache.org/jira/browse/JAMES-555?page=all ] Norman Maurer reassigned JAMES-555: --- Assignee: Norman Maurer > Bayesiananalysis should support not only mysql and msql > --- > >

Re: [jira] Commented: (JAMES-555) Bayesiananalysis should support not only mysql and msql

2006-08-31 Thread Norman Maurer
Thx for that "contribution" i will add it to trunk soon. bye Norman Am Donnerstag, den 31.08.2006, 03:24 -0700 schrieb Guillermo Grandes (JIRA): > [ > http://issues.apache.org/jira/browse/JAMES-555?page=comments#action_12431854 > ] > > Guillermo Grandes commented on JAMES-555:

Re: [jira] Closed: (JAMES-514) Add Mailet that support jason forspamfiltering

2006-08-31 Thread Vincenzo Gianferrari Pini
Norman Maurer wrote: Am Mittwoch, den 30.08.2006, 16:08 +0200 schrieb Vincenzo Gianferrari Pini: To fully Support Gary's proposal we should collect a third field in the bayesiananalysis_ham and spam tables, containing his different counter, and the corpuses should be rebuilt from scratch.

[jira] Commented: (JAMES-555) Bayesiananalysis should support not only mysql and msql

2006-08-31 Thread Guillermo Grandes (JIRA)
[ http://issues.apache.org/jira/browse/JAMES-555?page=comments#action_12431854 ] Guillermo Grandes commented on JAMES-555: - It forgot it, goes in conf/sqlResources.xml in section: > Bayesiananalysis should support not only mysql and

[jira] Commented: (JAMES-555) Bayesiananalysis should support not only mysql and msql

2006-08-31 Thread Guillermo Grandes (JIRA)
[ http://issues.apache.org/jira/browse/JAMES-555?page=comments#action_12431853 ] Guillermo Grandes commented on JAMES-555: - Support for embedded Derby and impatient people who use James 2.3 :-) CREATE TABLE bayesiananal

Re: svn commit: r438517 - /james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java

2006-08-31 Thread Bernd Fondermann
wow. :-) ok, fine. Bernd On 8/31/06, Vincenzo Gianferrari Pini <[EMAIL PROTECTED]> wrote: The substring cpu time is not relevant compared to toLowerCase: 1 to 6 ratio :-) Let's keep it as is. Vincenzo Stefano Bagnara wrote: > Maybe he's referring to "tokenLower.substring(0, end)". > Th

Re: svn commit: r438517 - /james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java

2006-08-31 Thread Vincenzo Gianferrari Pini
The substring cpu time is not relevant compared to toLowerCase: 1 to 6 ratio :-) Let's keep it as is. Vincenzo Stefano Bagnara wrote: Maybe he's referring to "tokenLower.substring(0, end)". This appears twice in your code and could be moved to a local variable. Stefano Vincenzo Gianfer

Re: svn commit: r438517 - /james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java

2006-08-31 Thread Bernd Fondermann
Hi Vincenzo, I am simply referring to the tokenLower.substring(0, end) and tokenLower.substring(1, end) expressions in the patch, which from my view are computed two times if the if-conditions hold, once in the condition evaluation and once again in the body. This could count up. Not sure if it

Re: svn commit: r438517 - /james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java

2006-08-31 Thread Stefano Bagnara
Maybe he's referring to "tokenLower.substring(0, end)". This appears twice in your code and could be moved to a local variable. Stefano Vincenzo Gianferrari Pini wrote: Bernd, I don't understand what you mean by "duplicated substrings". If you mean the substrings added to the tokens ArrayList

Re: svn commit: r438517 - /james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java

2006-08-31 Thread Vincenzo Gianferrari Pini
Bernd, I don't understand what you mean by "duplicated substrings". If you mean the substrings added to the tokens ArrayList, only the most significant of them (highest "probability strength") is later on kept by the calling method (getTokenProbabilityStrengths). This is the way it is expecte

Re: Configuration API

2006-08-31 Thread Norman Maurer
Am Mittwoch, den 30.08.2006, 19:54 +0200 schrieb Bernd Fondermann: > On 8/30/06, Noel J. Bergman <[EMAIL PROTECTED]> wrote: > > > > So I would move to use Jakarta Commons Configuration for JAMES, rather than > > the Avalon configuration, for both JAMES and the Mailet API (caveat: Danny > > suggests