Re: [Dspam-user] spam_train does not work

Jehan Pagès Thu, 05 Feb 2009 15:40:14 -0800

Hi,

On Thu, Feb 5, 2009 at 9:56 PM, Steve <[email protected]> wrote:


> btw: I do only SC/NC training over here. For that I have a modified
> training script doing the SC/NC training and doing TONE (train near on on
> error). The result is that training now takes 1/2 up to 1/3 of normal
> training time and the data is way, way, way smaller then with normal
> training. Just the other week I helped another Gentoo user with training of
> his DSPAM installation. Result was that he got down from 483.33M of data
> (normal training) down to 47.79M data. The training was slightly faster but
> not so significant as on my setup (keep in mind that I have a significantly
> rewritten storage driver for MySQL on one of my setups).
> Checking accuracy on a corpus he has never trained before resulted in 99.x%
> and on another corpus it resulted in 98.x%. I am not 100% sure about the
> accuracy since I can not find his mail with the real numbers but I remember
> one to be 99.9 something and the other > 98.5 something. Anyway... the
> accuracy was pretty high considering that he never has trained mails from
> that corpus and considering that it was an very messy and error prone corpus
> (spamarchive.org submit and autosubmit).
>

Thanks for your offer, this looks very interesting and I will probably come
back to it soon. I just have to deal another configuration issue first...
(see below)


>
> > But the training output looks
> > much
> > more interesting now! Not all messages are passing as non-spam as it did
> > before (I had 0 true positive, now I have 478 of them!).
> >
> > So things look better, thanks very much! I will now wait a day or so to
> > check that some spams are really caught and sent to the Junk/ folder of
> my
> > mailbox as expected in my dspam configuration. If this works, I will be
> > happy and report it here.
>

So all was nearly good. Now dspam is trained (it could be better as you
said, but it is enough for my configuration tests) and it is tagging a lot
of my spams, with [SPAM] on the subject and in the header stuffs like this:

X-DSPAM-Result: Spam
X-DSPAM-Processed: Thu Feb  5 23:49:02 2009
X-DSPAM-Confidence: 0.5232
X-DSPAM-Probability: 0.9869
X-DSPAM-Signature: 498b6cde61901804284693
X-DSPAM-Factors: 27,
        Subject*watch, 0.99000,
        a+loved, 0.99000,
        Subject*for+all, 0.99000,
...
...

But I wanted spam to go in my Junk folder (using maildir, this is the
directory .maildir/.Junk for my configuration.
As I copied in a previous email, I set my quarantine agent like this:

QuarantineAgent "/usr/libexec/dovecot/deliver -d %u -m Junk"

But it doesn't look to work as my emails are still deliverd to INBOX.
I would like to check which agent is used with
spams and if the QuarantineAgent is indeed called. Depending on this,
I will be able to see if my issue is in dspam or dovecot.
Unfortunately I don't manage to get logs telling me this.
I tried with this, but it didn't show me anything interesting:

Debug *
DebugOpt process spam fp

I also tried to add some bash command to the QuarantineAgent ("echo
spam > /tmp/spam; /usr/libexec/dovecot/deliver -d %u -m Junk")
, but this variable is not processed like in a shell apparently hence it failed.

Would you know how I could diagnose this?
Thanks.

Jehan

------------------------------------------------------------------------------
Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM)
software. With Adobe AIR, Ajax developers can use existing skills and code to
build responsive, highly engaging applications that combine the power of local
resources and data with the reach of the web. Download the Adobe AIR SDK and
Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com

_______________________________________________
Dspam-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspam-user

Re: [Dspam-user] spam_train does not work

Reply via email to