On Wed, 2004-12-08 at 05:22, Alex Broens wrote: > Matthew Newton wrote: > > Hello, > > > > I've recently installed SA 3.0.1, and found some junk was > > getting through with scores too low for my liking, especially before the > > URLs made it into SURBL. I've put together a few rules to match some > > of these that you might find interesting. > > > > They are: > > > > Finally, a string of words (more than 15 here) that all begin with a > > capital letter, and no punctuation (I'm only testing this one at the > > moment, hence the low score): > > > > body UOLCC_CAPWORD_TEST /([A-Z][a-z]{3,}\s{1,2}){15,}/s > > describe UOLCC_CAPWORD_TEST String of words that all begin with caps letter > > score UOLCC_CAPWORD_TEST 0.1 > > > > > > Hope these are of use to someone. If anyone can show me that they are > > likely to pick up false positives, I'd be most grateful. > > This will likely trigger on several airline ticket confirmation messages > which, for some unknown highly scientific reason, are always sent all caps.
Getting off topic here, but the all caps is probably a holdover from the old SABRE airline reservation system which used a 6-bit codeset to reduce the transmission time on their (at the time) slow data links. -Bill