Vincenzo,
do you intend to also eliminate the duplicated substrings or does it
not significantly lower memory/cpu load?
Bernd
On 8/30/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
Modified:
james/server/trunk/src/java/org/apache/james/util/BayesianAnalyzer.java
do {
+ if (!token.substring(0, end).equals(tokenLower.substring(0, end)))
{
+ tokens.add(header + tokenLower.substring(0, end));
if (header.length() > 0) {
+ tokens.add(tokenLower.substring(0, end));
}
}
if (end > 1 && token.charAt(0) >= 'A' && token.charAt(0) <= 'Z') {
end).toLowerCase());
+ tokens.add(header + token.charAt(0) + tokenLower.substring(1,
end));
if (header.length() > 0) {
+ tokens.add(token.charAt(0) + tokenLower.substring(1, end));
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]