On Sat, 27 Apr 2013, Alex wrote:

Hi,

To feed "ham" to bayes, should one only user mis-flagged mail, or may one
use unflagged (below 5) mail?

Expressed differently, can one feed "good" messages, "sa-learn --ham
path-to-ham " as one might feed missed spam, "sa-learn --spam path-to-spam"


You can train hams that have scored high (i.e. misclassified hams) and you
can proactively train low-scoring mail to try to avoid problems in the
first place.

If there are some spam messages with BAYES_00, and the database needs to be
corrected, is it best to just learn it as spam, or use --forget, then
--spam?

I just grepped the quarantine and there were a handful of BAYES_00 with
overall scores between 6 and 10.

Just re-learn it as spam, that automatically forgets that it was ham.

--forget is only useful to completely remove that message from the database.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  Any time law enforcement becomes a revenue center, the system
  becomes corrupt.
-----------------------------------------------------------------------
 331 days since the first successful private support mission to ISS (SpaceX)

Reply via email to