On Thu, 1 Oct 2009, jdow wrote:
From: "John Hardin" <jhar...@impsec.org>
Yours may still hit .cn in the path part. May I suggest:
m;^https?://[^/?]+\.cn\b;
Regardless of their correctness, would you care to expound on the success
of these two rules, John? I like what works not political correctness.
I think these are two interesting observations. Of course, they won't
work very well for somebody doing business with China or embedded within
the .cn TLD.
"what works" is based on the accuracy of the corpora. If the corpora show
lots of spam with .cn TLD URIs and little or no ham with such, then that
rule will hit often, and have a good S/O, and get a high score.
I too am surprised that .cn TLDs appear in 51% of the spam corpus but I
haven't looked into it in any detail. I can certainly check it against my
own corpora and see if it's reasonable - but then again, I don't do any
business with anyone in china, and I _do_ get a fair amount of bulk emails
from manufacturers in china purportedly looking for business partners.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
If "healthcare is a Right" means that the government is obligated
to provide the people with hospitals, physicians, treatments and
medications at low or no cost, then the right to free speech means
the government is obligated to provide the people with printing
presses and public address systems, the right to freedom of
religion means the government is obligated to build churches for the
people, and the right to keep and bear arms means the government is
obligated to provide the people with guns, all at low or no cost.
-----------------------------------------------------------------------
Approximately 9052800 firearms legally purchased in the U.S. this year