On Monday 22 August 2016 at 16:45:09, Dianne Skoll wrote:
> On Mon, 22 Aug 2016 07:34:00 -0700 Marc Perkel wrote:
> > > So. What percentage of emails using your algorithm are actually
> > > decidable?
> >
> > Almost 100% if you look at a wide variety of tokens from multiple
> > attributes.
>
> I can't believe that, or I'm missing something. Almost every spam I see
> contains words that also appear in ham. Things like "this" or "invoice"
> or "regards" or "dear".
>
> What am I missing?
I believe you're missing Marc's definition of "token".
Antony.
--
Anyone that's normal doesn't really achieve much.
- Mark Blair, Australian rocket engineer
Please reply to the list;
please *don't* CC me.