On Thu, 11 Feb 2010 16:50:13 +0000
Kārlis Repsons <[email protected]> wrote:

> May I ask 2 more questions?
> 
> With "TrainPristine on" it ought to become impossible to sort the 
> filtered mail by checking for some added spam header.
>
That is not true. Headers are added including a DSPAM signature BUT you are not 
able to retrain based on the signature because the signature will NOT be saved 
in dspam_signature_data.


> But if its 
> "TrainPristine off", I wonder, how much of the incoming mails will 
> be kept in database?
>
As much as you want. If you run the included purge scripts then +/- mails from 
15 days will be keept in the database.


> And if dspam is trained only with $ dspam_train and when dspam
> is offered a message from stdin, how will that serial number from
> database be found and used to relearn the original message?
> 
If you enable TrainPristine then the signature is IGNORED. You need to feed the 
whole message to DSPAM.

btw: Why training with dspam_train? You know that you can train with 
dspam/dspamc if you want.

> 
> 
> ATTACHED:
> 
> README says:
>    DSPAM uses a signature-based system, where a serial number of
>    sorts is appended to each email processed by DSPAM. DSPAM reads this serial
>    number when the user forwards (or bounced) a message to what is called 
> their
>    "spam email address". The serial number points to temporary information
>    stored on the server (for 14 days by default) containing all of the 
>    information necessary for DSPAM to relearn the message. This is necessary
>    in order to relearn the *exact* message DSPAM originally processed.
> 
>      If you are using an IMAP based system, Web-based email, or other form of
>      email management where the original messages are stored on the server in
>      pristine format, you can turn this signature feature off by setting
>      "TrainPristine on" in dspam.conf. DSPAM will then use the message itself
>      that you provide it to train, which MUST be identical to the original
>      message in order to retrain properly.
> 
> dspam.conf says:
>    # TrainPristine: for systems where the original message remains server side
>    # and can therefore be presented in pristine format for retraining. This 
> option
>    # will cause DSPAM to cease all writing of signatures and DSPAM headers to 
> the
>    # message, and deliver the message in as pristine format as possible. This 
> mode
>    # REQUIRES that the original message in its pristine format (as of 
> delivery)
>    # be presented for retraining, as in the case of webmail, imap, or other
>    # applications where the message is actually kept server-side during 
> reading,
>    # and is preserved. DO NOT use this switch unless the original message can 
> be
>    # presented for retraining with the ORIGINAL HEADERS and NO MODIFICATIONS.
>    #
>    # NOTE: You can't use this setting with dspam_trian; if you're going to 
> use it,
>    #       wait until after you train any corpora.
>    #
>    #TrainPristine on

------------------------------------------------------------------------------
SOLARIS 10 is the OS for Data Centers - provides features such as DTrace,
Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW
http://p.sf.net/sfu/solaris-dev2dev
_______________________________________________
Dspam-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to