Re: [Assp-test] Suggestion for corpus file management

2009-09-12 Thread K Post
OH! So with MaxBayesFileAge set to zero, move2numb off, and subject logging on, it'll pair down, removing oldest first? The admin says: This option is usefull, if UseSubjectsAsMaillogNames is set to on and doMove2Num is set to off, because in this case the number of files in every collection

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Fritz Borgstedt
ASSP development mailing list assp-test@lists.sourceforge.net schreibt: If you want ASSP to delete files based on their age instead of their quantity ( MaxFiles ), setup MaxBayesFileAge to your needs. This is not correct.

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Hill, Brett
Fritz Borgstedt wrote: If you want ASSP to delete files based on their age instead of their quantity ( MaxFiles ), setup MaxBayesFileAge to your needs. This is not correct. Ok, then replace quantity with number. It's the same thing. I got this from version 1.5.1.7(0.1.08). If you've

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread K Post
Ah, right, thanks for the nudge. SO questions on this concept, again considering that we're using subjectasmaillognames. 1) Wouldn't it be better to first remove files with the exact same subject instead of just their age, say leaving 2 or 3 for variance, but if there's 10 of the same, delete

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Fritz Borgstedt
There are unlimited ways of doing the reducing. I cannot see an advantage (or disadvantage) in the way you do it. V1 has an additional age (default=1000 days) for reports. May be, Thomas should add that to V2 too. --

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Fritz Borgstedt
ASSP development mailing list assp-test@lists.sourceforge.net schreibt: Fritz, I don't think the issue is HOW it's done. The issue (imho) is the wording in the descriptions, which can be confusing. I was answering to a proposal to do it differently from nntp.p...@gmail.com

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Fritz Borgstedt
ASSP development mailing list assp-test@lists.sourceforge.net schreibt: For example, in v2, it states that you shouldn't delete files using age, even though that's the only way to do it if you use filenames and also don't use move2num. It does not state this. It states that you should you not

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Fritz Borgstedt
ASSP development mailing list assp-test@lists.sourceforge.net schreibt: However, in discussions with Thomas, it seems that you can use MaxAge with Bayesian, subjectfilenames enabled and move2num disabled. Is this statement correct? You can, but it would be not recommended. The standard way works

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Fritz Borgstedt
ASSP development mailing list assp-test@lists.sourceforge.net schreibt: The last sentence indicates to me that the number of files will only be pruned base on the age limit set by MaxBayesFileAge and that it won't delete the oldest files until the number of files per folder reaches MaxFiles. Yes?

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Steve Mallindine
Opps, I think I've missed something... the standard way... What standard way? If subjectfilenames is set to enabled, and move2num is disabled, what other way is there to make sure maxfiles is never exceded? Cheers Steve Sent from my iPhone On 11.09.2009, at 23:08, Fritz Borgstedt

Re: [Assp-test] Suggestion for corpus file management

2009-09-11 Thread Fritz Borgstedt
ASSP development mailing list assp-test@lists.sourceforge.net schreibt: the standard way... What standard way? If subjectfilenames is set to enabled, and move2num is disabled, what other way is there to make sure maxfiles is never exceded? If MaintBayesCollection is set, the default way (standard

[Assp-test] Suggestion for corpus file management

2009-09-10 Thread K Post
We use file name logging because on occasion I'm forced to go through the logs and report back to my supervisor any emails that I find to be misclassified. It's a huge waste of time I know - it is what it is. Because of this need, we can't use move2num, so the collection quickly grows beyond the

Re: [Assp-test] Suggestion for corpus file management

2009-09-10 Thread K Post
You're kidding!!! That's great news. I've read the options in the rebuildspamdb section several times over the years, but I don't see anything on housekeeping even in this new (to me) v2. I must be going dumb - can you kindly nudge me in the right direction? What text should I search for to

Re: [Assp-test] Suggestion for corpus file management

2009-09-10 Thread Fritz Borgstedt
ASSP development mailing list assp-test@lists.sourceforge.net schreibt: can you kindly nudge me in the right direction? MaintBayesCollection : Maintenance for Bayesian Collection Set this to on, if you want ASSP to run a maintenance tasks on the bayesian collection folders ( spamlog , notspamlog