RE: repost: moving/adding bayes info to global DB

2006-12-14 Thread Karl Auer
On Thu, 2006-12-14 at 15:47 +0100, Giampaolo Tomassoni wrote:
> If you believe that each user gets more or less the same kind of
> e-mails (like, in example, when running a small-business MX), then you
> may think to switch to a per-system bayes db an preload that single db
> with the content of your own bayes.

Ok - how do I tell sa-learn to update the system database rather than
the DB under my home directory? I've read the sa-learn man page, and
there doesn't seem to be any appropriate switch.

One way (reading the man page for Mail::SpamAssassin::Conf) would be to
simply point the global SA bayes_path to my own ~/.spamassassin
directory...

Or I could create a special user, always run sa-learn as that user, and
point bayes_path to that user's .spamassassin directory (after
pre-loading the DB as you suggest).

Is there a Right Way?

The bayes stuff still seems to be used even when allow_user_rules is
false. On the other hand, bayes_path is one of the items that (according
to man Mail::SpamAssassin::Conf) cannot be set within a user_prefs file.
Which seems mean that the bayes DB location "~/.spamassassin" is
effectively hardcoded and immutable :-(

Regards, K. 

-- 
~~~
Karl Auer ([EMAIL PROTECTED])   +61-2-64957160 (h)
http://www.biplane.com.au/~kauer/  +61-428-957160 (mob)



RE: repost: moving/adding bayes info to global DB

2006-12-14 Thread Giampaolo Tomassoni
From: Karl Auer [mailto:[EMAIL PROTECTED]
> 
> For some time now, I have been busily accumulating bayes data by running
> sa-learn on various collections of emails. As myself, so I now have a
> nice big chunk o'data in ~/.spamassassin.
> 
> Since I am a newbie to SA, I didn't realise what was happening for some
> time. I actually wanted that data to be used globally, for all mails
> that spamassassin checks.

Since you are using per-user databases, there is no easy way to make that 
corpus available to every other SA user in your system.

You may use sa-learn --backup and --restore facilities to copy all that 
knowledge to someone else's account, but this would wipe the previous contents 
of the destinating bayes db, which may be less than optimal.

Due to how the bayes db is designed, you can't even "stack-up" the informations 
it contains in a multi-layered way, like, in example, by having a server-wide 
db and a per-user db which are inspected and updated in parallel: there is 
actually no way to merge data coming from multiple dbs as well as there is no 
way to update it.

If you believe that each user gets more or less the same kind of e-mails (like, 
in example, when running a small-business MX), then you may think to switch to 
a per-system bayes db an preload that single db with the content of your own 
bayes.

giampaolo


> Is there some simple way to do this? I no longer have the email that I
> used to train spamassassin, just ~/.spamassassin/bayes_seen and
> ~/.spamassassin/bayes_toks.
> 
> Regards, K.
> 
> -- 
> ~~~
> Karl Auer ([EMAIL PROTECTED])   +61-2-64957160 (h)
> http://www.biplane.com.au/~kauer/  +61-428-957160 (mob)
> 



Re: repost: moving/adding bayes info to global DB

2006-12-14 Thread Ian
On 15 Dec 2006 at 1:21, Karl Auer wrote:

> Hi there.
> 
> Just reposting a question to which I have as yet received no answer, in
> the hope that someone can assist...
> 
> Regards, K.

Hi,

I think the best way to do this would be to export the data from your exisiting 
bayes and 
then import it into the one you want.  

sa-learn --dbpath  --backup > sa_bayes_backup.txt
sa-learn --dbpath  --restore sa_bayes_backup.txt


watch out for the path to each command, I've got the feeling that if your bayes 
are like 
this:

/root/bayes_seen
/root/bayes_tokens ...

then the path will be /root/bayes , but I'm not completely sure.  Maybe someone 
else can 
jump in at this point...

Anyway, for more info on the sa-learn command use 'man sa-learn'

Regards

Ian
-- 



IMPORTANT:
This email is intended for the use of the individual addressee (s) 
named above and may contain information that is confidential, 
privileged or unsuitable for overly sensitive persons with low self-
esteem, no sense of humour or irrational religious beliefs. If you 
are not the intended recipient, any dissemination, distribution or 
copying of this email is not authorised (either explicitly or 
implicitly) and constitutes an Irritating social faux pas. Unless the 
word absquatulation has been used in its correct context somewhere 
other than in this warning, it does not have any legal or grammatical 
use and may be ignored. No animals were harmed in the transmission of 
this email, although the cat next door is living on borrowed time, 
let me tell you. Those of you with an overwhelming fear of the 
unknown will be gratified to learn that there is no hidden message 
revealed by reading this warning backwards, so just ignore that Alert 
Notice from Microsoft. However, by pouring a complete circle of salt 
around yourself and your computer you can ensure that no harm befalls 
you and your pets. If you have received this email in error, please 
place it in a warm oven for 40 minutes and add some nutmeg and egg 
whites. Whisk briefly and let it stand for 2 hours before icing.




repost: moving/adding bayes info to global DB

2006-12-14 Thread Karl Auer
Hi there.

Just reposting a question to which I have as yet received no answer, in
the hope that someone can assist...

Regards, K.

~~~ Forwarded Message ~~~
From: Karl Auer <[EMAIL PROTECTED]>
Reply-To: [EMAIL PROTECTED]
To: users@spamassassin.apache.org
Subject: moving/adding bayes info to global DB
Date: Sun, 10 Dec 2006 09:12:17 +1100

Hi there.

For some time now, I have been busily accumulating bayes data by running
sa-learn on various collections of emails. As myself, so I now have a
nice big chunk o'data in ~/.spamassassin.

Since I am a newbie to SA, I didn't realise what was happening for some
time. I actually wanted that data to be used globally, for all mails
that spamassassin checks.

Is there some simple way to do this? I no longer have the email that I
used to train spamassassin, just ~/.spamassassin/bayes_seen and
~/.spamassassin/bayes_toks.

Regards, K.

-- 
~~~
Karl Auer ([EMAIL PROTECTED])   +61-2-64957160 (h)
http://www.biplane.com.au/~kauer/  +61-428-957160 (mob)