-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 29/03/11 22:55, Stevan Bajić wrote:
> On Tue, 29 Mar 2011 22:40:30 +0200 Tom Hendrikx <[email protected]>
> wrote:
>
> On 29/03/11 20:44, Kenneth Marshall wrote:
>>>> On Tue, Mar 29, 2011 at 09:40:19PM +0300, Ibrahim Harrani
>>>> wrote:
>>>>> Hi Ken,
>>>>>
>>>>> I added the list to dspam.conf. Should/Can I delete from
>>>>> database those pattern as well?
>>>>>
>>>>> Thanks
>>>>>
>>>>
>>>> Unfortunately, only the hash of the header is stored so there
>>>> is not a way to identify the applicable tokens. You may want
>>>> to regenerate the corpus. If you create the table with a <100%
>>>> fillfactor, the training should go faster.
>>>>
>
> That is incorrect. You can easily recreate the hash with dspam_crc,
> f you know the original token:
>
> $ dspam_crc "Received*from+userid" TOKEN: 'Received*from+userid' CRC:
> 2977181303331328604
>
> vmail=# select * from dspam_token_data where token =
> '2977181303331328604'; uid | token | spam_hits |
> innocent_hits | last_hit
> -----+---------------------+-----------+---------------+------------
> 1 | 2977181303331328604 | 4 | 2203 | 2011-03-29
>
>
>> That is right however one needs to know as well how the tokenizer
>> has assembled those tokens. For the dull tokenizers (aka word and
>> chain) things are easy but as soon as you enter the more
>> intelligent tokenizers (aka osb and sbph) then things can quicky
>> get complicated.
>
That is true, but the OP had questions about tokens that appeared in his
logfile, these are ready to use with dspam_crc.
I agree that you should not try to manually reconstruct the tokens from
original data: I created my example copying a tokn from the
X-DSPAM-Factors mail header.
Regards,
Tom
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAk2SShsACgkQeEmCqmj6IjY5UwCghfCfnj+E8DTQ54d2TWftQyfq
9EoAn3ZuKM6bvTl4/1xjNawLki+9FId4
=+Rg7
-----END PGP SIGNATURE-----
------------------------------------------------------------------------------
Enable your software for Intel(R) Active Management Technology to meet the
growing manageability and security demands of your customers. Businesses
are taking advantage of Intel(R) vPro (TM) technology - will your software
be a part of the solution? Download the Intel(R) Manageability Checker
today! http://p.sf.net/sfu/intel-dev2devmar
_______________________________________________
Dspam-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspam-user