Re: My new method for blocking spam - REVEALED!

Marc Perkel Wed, 20 Jan 2016 10:59:29 -0800


On 01/20/16 10:36, John Hardin wrote:

On Wed, 20 Jan 2016, Marc Perkel wrote: .
So it still needs to be trained, at least initially, with amanually-vetted corpus. If not, how do you propose to do the initialclassification of messages for training?
Do you envision it being self-training past that point? What if itgoes off the rails? How would you keep it from going off the rails?
If it's not self-training then you have the same issues with thereliability of the people feeding the training corpus.

On my system I have a long list of good email sources that are 100%white listed and I also have hackerbot traps that are 100% spam. I usethese for training to keep it on the rails. Good question though.

So I'm not just tokenizing the subject. Also the first 25 words ofthe message
OK, good. I was thinking it would be *really* sensitive to "bayespoisoning". Ignoring all but the first part of the body helps.
I assume you're only considering the portion that would render asvisible to the recipient. Of course, that brings in all the logicregarding "what is visible to the recipient?" and all the HTMLobfuscation we're already seeing to get around Bayes and "only scanthe first part of the message".

Actually it's very insensitive to poisoning. Yes a spammer might cancelout some good phrases every now and then but since my system does NOTmatching on one side it's not as sensitive as Bayes. If they poison withthe same phrases twice I have them.



--
Marc Perkel - Sales/Support
supp...@junkemailfilter.com
http://www.junkemailfilter.com
Junk Email Filter dot com
415-992-3400

Re: My new method for blocking spam - REVEALED!

Reply via email to