maxAllowedDups has a very very low priority in assp, because maintening a
large amount of files may consum a large amount of system resources.
The spam folder will be maintened completely (and correctly) at startup
and at the end of the 'MaintBayesCollection' task.
you'll see a logline like
Mar-27-18 10:53:06 [Worker_10000] Info: MaxAllowedDups - 20,823 files
registered in spam folder - 0 files moved to folder C:/assp/discarded
If the same (or similar) subject is received multiple times within a short
time frame (some minutes), it may happen, that all these mails are
collected. If the same subject is received again some times later, while
the workload is low enought, the files for this subject will be maintened
by the SMTP worker. If the same subject is never seen again in a received
mail, the duplicate files (filenames) will be corrected with the next
regular maintenance task (startup/MaintBayesCollection).
The logline above (at startup) shows, that this feature is working like
expected. There were no unmaintened subjects at the last shutdown.
Having maxAllowedDups ignored for a specific subject at a point in time is
normal. Over a long time range, this feature works like expected.
Thomas
Von: "K Post" <nntp.p...@gmail.com>
An: "ASSP development mailing list" <assp-test@lists.sourceforge.net>
Datum: 27.03.2018 18:29
Betreff: Re: [Assp-test] maxAllowedDups not working when
punctuation is in subject???
Here's one that contains no punctuation that I have 50+ times in the
corpus
Subject: Shark Tank Loves This New Diet Product
Of the ones that didn't have punctuation that I spot checked, they all
were sent to an address that matched a Collect Address. Are collect
addresses exempt from the duplicate file name limit?
Here's an example of one with punctuation that is there many times and is
not sent to collect addresses
Subject: Ringing Ears? Eat This for Breakfast & Destroy Tinnitus
Fast.
Anyone else seeing anything like this?
On Tue, Mar 27, 2018 at 12:13 PM, K Post <nntp.p...@gmail.com> wrote:
I use subject names to store messages.
maxSubjectLength: 0
maxAllowedDups: 3
It's been set this exact way for years and generally seems to work.
However, having just completed going through the 15,000 spam messages in
/spam (that was fun, I see that some) subjects are repeated many many
times.
Most of these more then 3 duplicates (but I don't think all) seem to have
punctuation in the subject, and extra colon, ends in an exclamation mark,
a period, etc. Those punctuation marks are (rightfully) ignored in the
file name, but might they be used in the comparison so they're not coming
up as already there? I really don't know, I just know what I'm seeing.
Thanks
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test
DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally
privileged and protected in law and are intended solely for the use of the
individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no
known virus in this email!
*******************************************************
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test