Stevan Bajić wrote: > On Thu, 17 Dec 2009 18:28:58 +0100 > Frantisek Hanzlik<[email protected]> wrote: > >> I want upgrade several DSPAM installation, all of them use hash driver, >> to 3.9.0. Is there any suggestion? Is possible use old databases, or >> it is not recommended? >> > You can use old databases without issues. > > >> Maybe, because of different (better) charset decoding (important for >> me, as in Czech are used utf8, 8859-2, cp1250,.. codings) and html >> parsing in 3.9.0, there is better throw away old databases and create >> new, probably with corpus training utilizing? >> > Since you are using the Hash driver any training you would want to do > can only be on a per user basis since the Hash driver does not have > DSPAM-groups support.
Hello Stevan, how I have understand this (Hash driver does not have DSPAM-groups support) ? README says, that hash driver not support merged groups, but other are probably OK, yes? In my configurations I mailnly use "shared,managed" or "shared" groups and it work fine. Or isn't possible use dspan-train script for DSPAM pre-training? And, in dspam sources is scripts/train.pl script, for which purposes is it? > > I would say that you should keep the old databases and run daily the > clean process (cssclean/csscompress) to purge old tokens from the database. > Soon or later the old unused tokens will vanish from the database and you > will only have new tokens. > > As soon as you use 3.9.0 your users will benefit from the different (better) > charset decoding and html parsing. Purging/removing the database will not > affect that capability in any negative nor in any positive way. > Well, I understand. I wanted try pre-train dspam from prepared spam and ham corpus, as I expect slightly better accuracy in addition to start with 3.9.0-fine CSS, especially on lazy users, which not train dspam fairly. Sorry for my terrible english. Thanks, Franta Hanzlík ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Dspam-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspam-user
