On Thu, 11 Feb 2010 16:50:13 +0000 Kārlis Repsons <[email protected]> wrote:
> May I ask 2 more questions? > > With "TrainPristine on" it ought to become impossible to sort the > filtered mail by checking for some added spam header. > That is not true. Headers are added including a DSPAM signature BUT you are not able to retrain based on the signature because the signature will NOT be saved in dspam_signature_data. > But if its > "TrainPristine off", I wonder, how much of the incoming mails will > be kept in database? > As much as you want. If you run the included purge scripts then +/- mails from 15 days will be keept in the database. > And if dspam is trained only with $ dspam_train and when dspam > is offered a message from stdin, how will that serial number from > database be found and used to relearn the original message? > If you enable TrainPristine then the signature is IGNORED. You need to feed the whole message to DSPAM. btw: Why training with dspam_train? You know that you can train with dspam/dspamc if you want. > > > ATTACHED: > > README says: > DSPAM uses a signature-based system, where a serial number of > sorts is appended to each email processed by DSPAM. DSPAM reads this serial > number when the user forwards (or bounced) a message to what is called > their > "spam email address". The serial number points to temporary information > stored on the server (for 14 days by default) containing all of the > information necessary for DSPAM to relearn the message. This is necessary > in order to relearn the *exact* message DSPAM originally processed. > > If you are using an IMAP based system, Web-based email, or other form of > email management where the original messages are stored on the server in > pristine format, you can turn this signature feature off by setting > "TrainPristine on" in dspam.conf. DSPAM will then use the message itself > that you provide it to train, which MUST be identical to the original > message in order to retrain properly. > > dspam.conf says: > # TrainPristine: for systems where the original message remains server side > # and can therefore be presented in pristine format for retraining. This > option > # will cause DSPAM to cease all writing of signatures and DSPAM headers to > the > # message, and deliver the message in as pristine format as possible. This > mode > # REQUIRES that the original message in its pristine format (as of > delivery) > # be presented for retraining, as in the case of webmail, imap, or other > # applications where the message is actually kept server-side during > reading, > # and is preserved. DO NOT use this switch unless the original message can > be > # presented for retraining with the ORIGINAL HEADERS and NO MODIFICATIONS. > # > # NOTE: You can't use this setting with dspam_trian; if you're going to > use it, > # wait until after you train any corpora. > # > #TrainPristine on ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Dspam-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspam-user
