Vincenzo,

do you intend to also eliminate the duplicated substrings or does it
not significantly lower memory/cpu load?

 Bernd

On 8/30/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
Modified: 
james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java
         do {
+            if (!token.substring(0, end).equals(tokenLower.substring(0, end))) 
{
+                tokens.add(header + tokenLower.substring(0, end));
                 if (header.length() > 0) {
+                    tokens.add(tokenLower.substring(0, end));
                 }
             }
             if (end > 1 && token.charAt(0) >= 'A' && token.charAt(0) <= 'Z') {
end).toLowerCase());
+                tokens.add(header + token.charAt(0) + tokenLower.substring(1, 
end));
                 if (header.length() > 0) {
+                    tokens.add(token.charAt(0) + tokenLower.substring(1, end));

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to