Yesterday I saw a spam filter insert a "PROBABLY-SPAM" warning into the subject line of an innocent message. It also appended what looked like detailed reasoning to the body of the message: a list of UNDERSCORE_SEPARATED heuristic names accompanied by a score value. It also said that the overall score spam threshold was 5; apparently that overall score was the sum of individual scores.

Well, the message was ranked 2.6 by a DEAR_SOMETHING heuristic, and ended up classified as "spam" (5.3) due to the combined efforts of other incomprehensible mail-header heuristics. Is it really that bad to start a message with "Dear Something", and even if it is, is it really more typical of spam messages than other messages?

Needless to say, the non-technical receiver of the message didn't understand any of the COMPUTER_GENERATED rubbish at the bottom.

Reply via email to