I just had a thought.... I've been getting tons of spam over the last few months, that I still have in my email client's spam folder (just haven't gotten around to deleting it). Probably have over 30 or 40K spam msgs. Can I just take those and copy them all directly into the spam/ directory to help give the corpus something to start with instead of just an empty file? Similarly, can I take valid emails and just drop them all in the notspam/ directory?
Corpus emails don't necessarily need to go through ASSP to build up the corpus, do they? Is there any danger in my doing what I suggested? I am assuming that the rebuildspamdb.pl strips out all the headers (to/from/etc), so I shouldn't have to worry that all the emails would be addressed to or from a small subset of users? Thanks! Eric ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Assp-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/assp-user
