> There's 107 characters between largest and quickest
>
> I shortened my original test to:
>
> This is simulating a spammer email. We have the largest selection of
products
> available anywhere in the world!  We provide the quickest delivery.
>
>
> Why order anywhere else? There is no one better, or faster than us.
>
> It doesn't trip the rule.

"body" isn't always what you might think it is.  In a pure text message the
body will probably come close to being the full message body, if there
aren't any URLs in the message.  In an HTML message like a spammer would
send, the full message body is split into multiple individual 'body' hunks,
generally along the lines of <p> delimited.

So, for instance, a body test on the example above might fail if it needed a
hit in both paragraphs.  Because they might be separate bodies.

This "feature" of SA (and it is a feature, I reported it as a bug and had it
soundly rejected) makes any counting algorithm VERY iffy, at best.  Unless
all of your hits are going to be in a single paragraph, with no intervening
urls, it probably won't work in a significant percentage of the available
cases.

        Loren

Reply via email to