Spyros Tsiolis wrote: > 3. Untarred the messages in such a way that I got hold of all > the messages from 12.00a.m. August 1st, 2008 to today > (October 24th, that is :-), with the help of a couple of > switches (thank you TAR !!) BTW, here are the final result > counts from the spam / ham files _after_ the filtering, that is : > > Spam items total : 7,827 (from August to now) Ham items > total : 6,480 (from August to now)
I would have tried to leave around 14,000 messages in each folder (you can still do that). That will give you a more comprehensive bayesian database. > 5. Started ASSP again and went to "Other Settings" -> "Max > Files" and set this to 18,000 (~8,000 + ~6,500 = 14,500) I think 14,000 is an appropriate number if you've got a mature collection. > 7. Stopped the ASSP process on the box You didn't need to do that, move2num will work fine with ASSP turned on. > Please let me know if everything is ok. Other than the above things, it sounds like you're on the right track! Kind Regards, Brett ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Assp-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/assp-user
