Hi,
Some time ago we created the following rule on this list to identify
mail with less than 200 characters in the body:
uri __HAS_HTTP_URI m~^https?://~
rawbody __KB_RAWBODY_200 /^.{0,200}$/s
meta LOC_SHORT (__HAS_HTTP_URI && __KB_RAWBODY_200)
score LOC_SHORT 0.6
describe LOC_SHORT Has URI and short body
I'm finding that it's hitting on mail that is much larger than 200
characters and I don't understand why. Is it only the text/plain
component of the body? Here's an example:
http://pastebin.com/raw.php?i=XNHjxfTz
Any ideas greatly appreciated.
Thanks,
Alex