On Wed, 2004-12-08 at 05:22, Alex Broens wrote:
> Matthew Newton wrote:
> > Hello,
> > 
> > I've recently installed SA 3.0.1, and found some junk was
> > getting through with scores too low for my liking, especially before the
> > URLs made it into SURBL. I've put together a few rules to match some
> > of these that you might find interesting.
> > 
> > They are:
> > 
> > Finally, a string of words (more than 15 here) that all begin with a
> > capital letter, and no punctuation (I'm only testing this one at the
> > moment, hence the low score):
> > 
> > body      UOLCC_CAPWORD_TEST /([A-Z][a-z]{3,}\s{1,2}){15,}/s
> > describe  UOLCC_CAPWORD_TEST String of words that all begin with caps letter
> > score     UOLCC_CAPWORD_TEST 0.1
> > 
> > 
> > Hope these are of use to someone. If anyone can show me that they are
> > likely to pick up false positives, I'd be most grateful.
> 
> This will likely trigger on several airline ticket confirmation messages 
> which, for some unknown highly scientific reason, are always sent all caps.

Getting off topic here, but the all caps is probably a holdover from 
the old SABRE airline reservation system which used a 6-bit codeset
to reduce the transmission time on their (at the time) slow data links.

        -Bill


Reply via email to