On Sat, 19 Feb 2011, Warren Togami Jr. wrote:

On 2/19/2011 6:43 AM, John Hardin wrote:
 On Sat, 19 Feb 2011, Justin Mason wrote:

>  On Friday, February 18, 2011, Warren Togami Jr. <[email protected]>
>  wrote:
> > > Is there any way our corpora can be part of SOUGHT's safety net
> >  without giving up our privacy?
> > Unfortunately not --- the generation process makes the entire mail
>  contents fully visible.

 Perhaps yes, though. The generation process could check the masscheck
 results of the SOUGHT subrules and permanently suppress any subrules
 that hit (a certain threshold of) ham.

 It's feedback, just not as immediate as the dedicated SOUGHT ham corpus,
 and it doesn't require exposing the masscheck ham corpora. I think
 that's what Warren had in mind.

Aren't the nightly masscheck subrules combinations of many patterns, so you aren't sure exactly which pattern is bad?

I apologize, I was unclear in my suggestion.

This is what I was referring to:

http://ruleqa.spamassassin.org/?rule=%2F__SEEK

Each __SEEK subrule is for a specific pattern. The SOUGHT rules are metas of several __SEEK rules.

As an example, here's a pattern subrule that might be too FP-prone and would be a candidate for being automatically suppressed after the masscheck results were analyzed:

http://ruleqa.spamassassin.org/20110219-r1072269-n/__SEEK_FMJXND/detail

What I'm suggesting is a mini-masscheck for SOUGHT only. Put each pattern into thousands of individual sub-rules. Participants run this mini-masscheck against their ham only, then upload the logs directly to JM.

Ah, okay. We weren't thinking along the same lines, then.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 [email protected]    FALaholic #11174     pgpk -a [email protected]
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  My sidearm is a piece of emergency equipment. It absolutely must
  be reliable, not "smart".
-----------------------------------------------------------------------
 3 days until George Washington's 279th Birthday

Reply via email to