Odhiambo Washington wrote:
* On 26/01/07 17:41 +0800, Kent Tong wrote:
| Hi,
|
| I'm pilot testing dspam and is training it. It detects spam in English
| or Russian quite well, but it almost always fails to detects spam in
| Chinese. I've fed it with about 2,000 ham in my mail box and corrected
| about 200 missed spams (false positive), but it doesn't seem to be
| improving.
|
| Does anyone have good experience with Chinese spam?
How much spam (esp chinese) have you trained it with?
Below is the stats:
dspam_stats -H [EMAIL PROTECTED]
[EMAIL PROTECTED]:
TP True Positives: 648
TN True Negatives: 290
FP False Positives: 0
FN False Negatives: 202
SC Spam Corpusfed: 89
NC Nonspam Corpusfed: 1359
TL Training Left: 851
SHR Spam Hit Rate 76.24%
HSR Ham Strike Rate: 0.00%
OCA Overall Accuracy: 82.28%
Among those false negatives, at least 50% are Chinese. So, at least 100
Chinese spam have been fed to dspam as errors.
--
Kent Tong
Post questions on our IT support forum (http//www2.cpttm.org.mo/forum).
Responses are guaranteed in 3 working days.