Am Freitag, 25. Februar 2005 02:29 schrieb Robert Menschel: > Hello Thomas, > > Thursday, February 24, 2005, 1:51:53 AM, you wrote: > >> SA has now re-created everything; only, now I have no > >> bayes_journal any more at all: > > TA> Why didn't you try sa-learn --force-expire > TA> This should reduce your journal and bayes_toks. > TA> Now you must train your bayes again with ham and spam. > > My impression was that force-expire would reduce the size of > bayes_toks and bayes_seen, but not bayes_journal -- files in > bayes_journal are waiting to go into the database, and if that file > had grown as large as Mark indicated, then there was some problem > preventing bayes_journal from being written to the other two files. > > That was my understanding, based on what I've read here and seen in > operation, without looking at any code. Corrections are welcome. > > Bob Menschel
I have not looking in the code either, but executing --force-expire I get the
following result. ( My journal is a bit small because I used the command 2
dayes ago for the first time. So I have not so much expirience with this
command).
[EMAIL PROTECTED]:~/.spamassassin> ll
insgesamt 6376
-rw------- 1 thomas users 176128 2005-02-25 10:58 auto-whitelist
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.dir
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.pag
-rw------- 1 thomas users 25152 2005-02-25 10:58 bayes_journal
-rw------- 1 thomas users 2617344 2005-02-25 10:58 bayes_seen
-rw------- 1 thomas users 5341184 2005-02-25 10:58 bayes_toks
-rw-r--r-- 1 thomas users 1269 2005-02-18 05:49 user_prefs
[EMAIL PROTECTED]:~/.spamassassin> sa-learn --force-expire
synced Bayes databases from journal in 0 seconds: 455 unique entries (1048
total entries)
expired old Bayes database entries in 54 seconds
134373 entries kept, 7753 deleted
token frequency: 1-occurence tokens: 61.66%
token frequency: less than 8 occurrences: 23.19%
[EMAIL PROTECTED]:~/.spamassassin> ll
insgesamt 6304
-rw------- 1 thomas users 176128 2005-02-25 11:07 auto-whitelist
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.dir
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.pag
-rw------- 1 thomas users 16512 2005-02-25 11:07 bayes_journal
-rw------- 1 thomas users 2617344 2005-02-25 11:07 bayes_seen
-rw------- 1 thomas users 5369856 2005-02-25 11:07 bayes_toks
-rw-r--r-- 1 thomas users 1269 2005-02-18 05:49 user_prefs
A immediately, sa-learn --sync give this result (I didn't stop spamd/spamc so
there may be messages spamchecked in the mean time)
[EMAIL PROTECTED]:~/.spamassassin> sa-learn --sync
synced Bayes databases from journal in 0 seconds: 509 unique entries (688
totalentries)
[EMAIL PROTECTED]:~/.spamassassin> ll
insgesamt 6284
-rw------- 1 thomas users 176128 2005-02-25 11:07 auto-whitelist
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.dir
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.pag
-rw------- 1 thomas users 2617344 2005-02-25 11:14 bayes_seen
-rw------- 1 thomas users 5369856 2005-02-25 11:14 bayes_toks
-rw-r--r-- 1 thomas users 1269 2005-02-18 05:49 user_prefs
and after a few minutes:
[EMAIL PROTECTED]:~/.spamassassin> ll
insgesamt 6288
-rw------- 1 thomas users 176128 2005-02-25 11:17 auto-whitelist
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.dir
-rw------- 2 thomas users 12355 2005-02-18 05:18 auto-whitelist.pag
-rw------- 1 thomas users 1152 2005-02-25 11:17 bayes_journal
-rw------- 1 thomas users 2617344 2005-02-25 11:17 bayes_seen
-rw------- 1 thomas users 5369856 2005-02-25 11:17 bayes_toks
-rw-r--r-- 1 thomas users 1269 2005-02-18 05:49 user_prefs
As you see after the second run the journal size is 0.
Maybe --sync is the better one for shrinking the journal.
Regards
Thomas
--
icq:133073900
http://www.t-arend.de
pgpzzCZXxTqfQ.pgp
Description: PGP signature
