How hard is it to keep list mail on list and not reply directly to sender?

Have you seen
https://svn.apache.org/repos/asf/spamassassin/trunk/contrib/HOWTO.Bayes-Redis/ ?

there may be some helpful info in there.

On 4/16/21 9:47 AM, Christian Völker wrote:
Thanks for the hint. I will monitor it. The machine has 16GB of memory which should be sufficient but I already notivce the preallocation of redis with 2GB.

It is somehow unclear what happens. If there is no limit I will get an OOM errror and redis will (if killed) loose the last transactions after the last "save 900 1" snapshot, right?

If I set a limit it will discard the oldest entries, correct?

Both seems not to be perfect for Spamassassin.

However, I will ignore the topic for the moment and see how it goes. 16GB shoud (hopefully) be enough. Once scanned the expired rules of Spamassassin should take place and reduce the amount of memory.

Greetings

/Christian




Am 16.04.2021 um 09:15 schrieb Axb:
To avoid suprises, remember to watch your memory usage.
Redis reads/writes the DB in memory and only dumps to disk for backup.

"redis-cli info" is of help


On 4/16/21 9:10 AM, Christian Völker wrote:
Sorry to annoy you. Another addition to my tests:

When using redis it took me around 15seconds to scan ~1,500 messages.
When using MariaDB it took one minute to do the same.
With file based I had strange issues whatever lock type eI used (flock yes/no): "bayes: bayes db version 0 is not able to be used, aborting! at /usr/share/perl5/Mail/SpamAssassin/BayesStore/DBM.pm line 206."


Anyways, now using Redis which appears to be the fastest.

Thanks again!

/Christian



Am 16.04.2021 um 08:48 schrieb Christian Völker:
Hi,
So I will re-configure my installation to use MariaDB.
You should also consider the Redis backend.

Ok, had a look when using MariaDB and I monitored it for the last 24hrs. My 10 vCPUs where used, no I/O waits. But CPU usage overall was according to "top" only at 25% as top showed 75% idle. I assume there is some locking in place limiting the CPU usage.

I configured it now to use Redis instead of MySQL and top tells me about 25% idle with 0% I/O waits when running 10 sa-learn in parallel. Increasing or decreasing the number of jobs does not significally change the idle percentage.

So using redis the CPU usage is higher compared to MySQL.

Thanks for ideas!

/Christian







Reply via email to