Re: per-user bayes

2020-12-09 Thread Benny Pedersen
hg user skrev den 2020-12-09 08:57: I believe that a SA plugin (like bayes) is able to know the envelope MAIL FROM and RCPT TO values... is it correct? If it is possible we "just" have to modify the bayes plugin provide this patch first and ask later :=) bayes does not fokus o

Re: per-user bayes

2020-12-09 Thread Dean Carpenter
user identifier. > Fair warning - if you go full per-user on a large system, this will MASSIVELY > balloon the size of your Bayes database, and most users will idle below the > learning thresholds for quite a long time. > Can you give an idea of the size calculation? I'm wantin

Re: per-user bayes

2020-12-09 Thread deano-spamassassin
> a large system, this will MASSIVELY balloon the size of your Bayes database, > and most users will idle below the learning thresholds for quite a long time. > Can you give an idea of the size calculation? I'm wanting to do this, but I > need to figure out how much space I need to

Re: per-user bayes

2020-12-09 Thread @lbutlr
loon the size of your >> Bayes database, and most users will idle below the learning thresholds >> for quite a long time. > Can you give an idea of the size calculation? I'm wanting to do this, > but I need to figure out how much space I need to allocate per user! That woul

Re: per-user bayes

2020-12-08 Thread hg user
I believe that a SA plugin (like bayes) is able to know the envelope MAIL FROM and RCPT TO values... is it correct? If it is possible we "just" have to modify the bayes plugin On Tue, Dec 8, 2020 at 10:13 PM Benny Pedersen wrote: > micah anderson skrev den 2020-12-08 21:54: &g

Re: per-user bayes

2020-12-08 Thread Benny Pedersen
micah anderson skrev den 2020-12-08 21:54: Kris Deugau writes: There will only be one database and set of tables, but one of the fields in each table is the user identifier. Fair warning - if you go full per-user on a large system, this will MASSIVELY balloon the size of your Bayes

Re: per-user bayes

2020-12-08 Thread micah anderson
Kris Deugau writes: > There will only be one database and set of tables, but one of the fields > in each table is the user identifier. Fair warning - if you go full > per-user on a large system, this will MASSIVELY balloon the size of your > Bayes database, and most users wil

Re: per-user bayes

2020-12-08 Thread Kris Deugau
Benoit Panizzon wrote: Hi This may help <http://svn.apache.org/repos/asf/spamassassin/branches/duncf_masses/sql/README.bayes> I sort of have the same issue. Unfortunately that does not help, it merely explains how to store bayes data in a database. But there is still only one '

Re: per-user bayes

2020-12-08 Thread @lbutlr
On 08 Dec 2020, at 08:36, Benoit Panizzon wrote: > Adding the list back to CC as I believe this is an interesting topic > many have pondered over. Forgot to fix the reply to on this list for some reason. Fixed now. > Yes, I see that is states 'per user' but I still don't see, how that

Re: per-user bayes

2020-12-08 Thread Benoit Panizzon
Hi Adding the list back to CC as I believe this is an interesting topic many have pondered over. Yes, I see that is states 'per user' but I still don't see, how that 'bayes user' is being set on a per recipient base. On the email platform there is ONE config file for spamassassin. So if I set

Re: per-user bayes

2020-12-08 Thread Benoit Panizzon
Hi > This may help > > <http://svn.apache.org/repos/asf/spamassassin/branches/duncf_masses/sql/README.bayes> I sort of have the same issue. Unfortunately that does not help, it merely explains how to store bayes data in a database. But there is still only one 'global' databa

Re: per-user bayes

2020-12-08 Thread @lbutlr
On 07 Dec 2020, at 13:56, micah anderson wrote: > A per-user setup would let each user do their own thing, but I don't see > how I can do that because our system doesn't have individual system > users and I don't see that there are options in the bayes sql > configuration or per

per-user bayes

2020-12-07 Thread micah anderson
Hi all, I've got a site-wide bayes mysql setup. It keeps getting poisoned quickly, because the user patterns are far too divergent from each other. One person's spam is another person's ham, nobody is happy. A per-user setup would let each user do their own thing, but I don't see how I can do

Re: bayes and InnoDB read locks

2020-11-30 Thread Benny Pedersen
Alex skrev den 2020-11-30 17:33: 2020-11-30 10:11:46 2772099 [Note] InnoDB: *** (2) WAITING FOR THIS LOCK TO BE GRANTED: have you tryed do the bayes with AriaDB ? note i am not expert at all

bayes and InnoDB read locks

2020-11-30 Thread Alex
Hi, I recently set up a central database server to store bayes data for a handful of mail relays to query for bayes info. I've done this in the past and don't recall there being a problem with read locks, but hoped someone could explain why I'm now seeing errors/warnings like this when using sa

Re: Why does sa-compile access the bayes db?

2020-05-29 Thread Matus UHLAR - fantomas
On 2020-05-28 15:32, Bert Van de Poel wrote: Almost all of the email we process are forwarders. It doesn't really make sense for us to do a non-global bayes db. The large majority of email we process is also for a uniform group: student organizations at our local university. On 28.05.20 21:05

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Bert Van de Poel
Oh, I had misunderstood you, Matus. My bad! I thought you meant we should use a separate bayes db for every mailbox user, but now I understand you were referring to the amavis user which indeed runs everything. I just moved the existing bayes db (after stopping amavis of course

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Benny Pedersen
On 2020-05-28 15:32, Bert Van de Poel wrote: Almost all of the email we process are forwarders. It doesn't really make sense for us to do a non-global bayes db. The large majority of email we process is also for a uniform group: student organizations at our local university. does not matter

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Benny Pedersen
On 2020-05-28 15:22, Matus UHLAR - fantomas wrote: On 28.05.20 13:38, Bert Van de Poel wrote: We're using a global bayes_path defined in local.cf: This is your problem imho. if you use amavis, you need no bayes database, but amavis users', i guess in /var/lib/amavis/.spamassassin

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Benny Pedersen
On 2020-05-28 10:18, Matus UHLAR - fantomas wrote: I wonder where did these files come from. did you sety bayes_path in /etc/spamassassin/ ? setup userprefs file for amavisd, in this file make sure bayes data keep in amavisd user, not the spamassasin user where there is no write access

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Matus UHLAR - fantomas
On 28.05.20 15:32, Bert Van de Poel wrote: Almost all of the email we process are forwarders. It doesn't really make sense for us to do a non-global bayes db. The large majority of email we process is also for a uniform group: student organizations at our local university. you have

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Bert Van de Poel
Almost all of the email we process are forwarders. It doesn't really make sense for us to do a non-global bayes db. The large majority of email we process is also for a uniform group: student organizations at our local university. On 28/05/2020 15:22, Matus UHLAR - fantomas wrote

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Matus UHLAR - fantomas
On 28.05.20 13:38, Bert Van de Poel wrote: We're using a global bayes_path defined in local.cf: This is your problem imho. if you use amavis, you need no bayes database, but amavis users', i guess in /var/lib/amavis/.spamassassin/ On 28/05/2020 10:18, Matus UHLAR - fantomas wrote

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Bert Van de Poel
We're using a global bayes_path defined in local.cf: use_bayes 1 use_bayes_rules 1 bayes_auto_learn 1 bayes_expiry_max_db_size 150 bayes_path /var/lib/spamassassin/bayes_db/bayes bayes_file_mode 0775 bayes_ignore_to spam-analy...@ulyssis.org bayes_ignore_from spam-analy...@ulyssis.org

Re: Why does sa-compile access the bayes db?

2020-05-28 Thread Matus UHLAR - fantomas
of spam and ham we've received, then enabling autolearn to further train the set. As Spamassassin runs inside Amavis, the Bayes database files are owned by the amavis user. This setup works fine, and results for Bayes are great and growing in accuracy by autolearning. What was somewhat

Re: Why does sa-compile access the bayes db?

2020-05-27 Thread Bert Van de Poel
Plugin initialization+journal sync would make a lot of sense. What would be the cleanest solution in that case? It's quite annoying to receive the same error mail every day. Should we use --cnf to disable the bayes plugin, or is there a more elegant solution? Should we file a bug about

Re: Why does sa-compile access the bayes db?

2020-05-25 Thread RW
On Mon, 25 May 2020 23:34:27 +0200 Bert Van de Poel wrote: > My question therefore specifically is: what exactly does sa-compile > do to the bayes database files? I don't know for sure, but it's probably just a side-effect of initializing plugins. Possibly it's trying to p

Why does sa-compile access the bayes db?

2020-05-25 Thread Bert Van de Poel
of spam and ham we've received, then enabling autolearn to further train the set. As Spamassassin runs inside Amavis, the Bayes database files are owned by the amavis user. This setup works fine, and results for Bayes are great and growing in accuracy by autolearning. What was somewhat confusing

Re: Bayes files LOCK

2020-02-14 Thread RW
ites through the journal they are > > lockless. > > So, auto-training may be problematic w/r/t locking as well. I presume > that window is coded to be as small as possible. I wasn't referring auto-training, that's done at the end under locking, and picking-up count updates

Re: Bayes files LOCK

2020-02-14 Thread Pedro David Marco
On Friday, February 14, 2020, 7:46:18 PM GMT+1, John Hardin wrote: >> I was looking at it in a bit more detail and it looks like there isn't >> a reader-writer lock,  just write locks for the toks and seen >> files. As scans defer their writes through the journal they are >> lockless.

Re: Bayes files LOCK

2020-02-14 Thread John Hardin
On Fri, 14 Feb 2020, RW wrote: On Fri, 14 Feb 2020 14:38:05 + (UTC) Pedro David Marco wrote: On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW wrote: That would defeat the object of having a journal file.>>Even if you are right, it doesn't really explain anything because it>applies to

Re: Bayes files LOCK

2020-02-14 Thread RW
On Fri, 14 Feb 2020 14:38:05 + (UTC) Pedro David Marco wrote: > > > >On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW > > wrote: That would defeat the object of > >having a journal file.>>Even if you are right, it doesn't really > >explain anything because it>applies to everyone using 

Re: Bayes files LOCK

2020-02-14 Thread Pedro David Marco
>On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW > wrote: >That would defeat the object of having a journal file.>>Even if you are right, >it doesn't really explain anything because it>applies to everyone using  >BDB/DBM/SDBM. >>IIWY I'd be looking at what's different for you. I basically

Re: Bayes files LOCK

2020-02-14 Thread RW
On Thu, 13 Feb 2020 15:29:46 + (UTC) Pedro David Marco wrote: > >>On 13.02.20 12:30, RW wrote: > >>Bayes doesn't write on scans (unless it does an opportunistic sync > >>or > >expiry): > > > >doesn't it record token access times to journa

Re: Bayes files LOCK

2020-02-13 Thread RW
On Thu, 13 Feb 2020 15:05:57 +0100 Matus UHLAR - fantomas wrote: > >On Wed, 12 Feb 2020 18:03:01 + (UTC) > >Pedro David Marco wrote: > >> i am getting  errors from Byes because it is not able to lock Bayes > >> files... Error log is:  > >>   bayes

Re: Bayes files LOCK

2020-02-13 Thread Pedro David Marco
>On 13.02.20 12:30, RW wrote: >>Bayes doesn't write on scans (unless it does an opportunistic sync or >expiry): > >doesn't it record token access times to journal? i think SA always does an EXclusive lock despite the parameters, probably because of that, Fantomas I

Re: Bayes files LOCK

2020-02-13 Thread Matus UHLAR - fantomas
On Wed, 12 Feb 2020 18:03:01 + (UTC) Pedro David Marco wrote: i am getting  errors from Byes because it is not able to lock Bayes files... Error log is:    bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call  SA tries to lock bayes

Re: Bayes files LOCK

2020-02-13 Thread RW
On Wed, 12 Feb 2020 18:03:01 + (UTC) Pedro David Marco wrote: > Hi.. > i am getting  errors from Byes because it is not able to lock Bayes > files... Error log is:  >   bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* > R/W: lock failed: Interrupted system c

Re: Bayes files LOCK

2020-02-13 Thread Matus UHLAR - fantomas
should help too. and I forgot to add: storing in redis makes only sense for shared BAYES database. -- Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/ Warning: I wish NOT to receive e-mail advertising to this address. Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu

Re: Bayes files LOCK

2020-02-13 Thread Matus UHLAR - fantomas
On 12.02.20 19:02, Pedro David Marco wrote: Fully aligned with my suspicions... I love Redis...  i think someone posted sometime ago a performance table showing local / sql / redis  ... i will look for it... but it would not surprise me if remote Redis is even faster than local DB... last

Re: Bayes files LOCK

2020-02-12 Thread Axb
On 2020-02-13 04:11, John Hardin wrote: On Wed, 12 Feb 2020, Pedro David Marco wrote: Hi.. i am getting  errors from Byes because it is not able to lock Bayes files... Error log is:   bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system

Re: Bayes files LOCK

2020-02-12 Thread John Hardin
On Wed, 12 Feb 2020, Pedro David Marco wrote: Hi.. i am getting  errors from Byes because it is not able to lock Bayes files...  Error log is:    bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call  SA tries to lock bayes files always

Re: Bayes files LOCK

2020-02-12 Thread Benny Pedersen
Kevin A. McGrail skrev den 2020-02-12 23:14: I don't know for sure but Alex Broens aka axb is a master at Bayes with Redis. Definitely look for his postings. lets see sqlite with DBI: supported in spamassassin i am happy with postgresql

Re: Bayes files LOCK

2020-02-12 Thread Kevin A. McGrail
I don't know for sure but Alex Broens aka axb is a master at Bayes with Redis. Definitely look for his postings. -- Kevin A. McGrail Member, Apache Software Foundation Chair Emeritus Apache SpamAssassin Project https://www.linkedin.com/in/kmcgrail - 703.798.0171 On Wed, Feb 12, 2020 at 2:02 PM

Re: Bayes files LOCK

2020-02-12 Thread Pedro David Marco
. On Wednesday, February 12, 2020, 7:32:42 PM GMT+1, Matus UHLAR - fantomas wrote: On 12.02.20 18:03, Pedro David Marco wrote: >i am getting  errors from Byes because it is not able to lock Bayes files...  >Error log is:  >  bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_*

Re: Bayes files LOCK

2020-02-12 Thread Matus UHLAR - fantomas
On 12.02.20 18:03, Pedro David Marco wrote: i am getting  errors from Byes because it is not able to lock Bayes files...  Error log is:    bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call  SA tries to lock bayes files always

Bayes files LOCK

2020-02-12 Thread Pedro David Marco
Hi.. i am getting  errors from Byes because it is not able to lock Bayes files...  Error log is:    bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call  SA tries to lock bayes files always in "EXclusive mode", hence when a sa

Re: [Fuglu-users] fuglu spamassassin bayes users

2020-01-29 Thread Benny Pedersen
Bill Cole skrev den 2020-01-25 21:01: Answering that question requires a more carefully written and detailed problem description. sorry for that

Re: bayes sql postgresql

2020-01-26 Thread Marc Stürmer
Zitat von Benny Pedersen : i came after using it this way for long time now that it could be more optimized with bayes user id vars, currently it creates new ids each time there is a new user, but it does not reuse old ids that is not used anymore after sa-learn --username f...@example.org

Re: [Fuglu-users] fuglu spamassassin bayes users

2020-01-25 Thread Bill Cole
am just unsure if it can be solved in fuglu or it needs to be supported in spamassassin to have mail alias support ? another inconvence is that if outbound spamassassin is used, its still recipient bayes learned, with means its not usefull learned, i had hoped it was keeped to belearned from

Fwd: [Fuglu-users] fuglu spamassassin bayes users

2020-01-25 Thread Benny Pedersen
is that if outbound spamassassin is used, its still recipient bayes learned, with means its not usefull learned, i had hoped it was keeped to belearned from sender domain with is local, not learned to remote recipieents my question is how to solve it in fuglu if it can at all ?

Re: bayes sql postgresql

2020-01-18 Thread RW
On Sat, 18 Jan 2020 13:31:10 +0100 Benny Pedersen wrote: > i came after using it this way for long time now that it could be > more optimized with bayes user id vars, currently it creates new ids > each time there is a new user, but it does not reuse old ids that is > not used anym

bayes sql postgresql

2020-01-18 Thread Benny Pedersen
i came after using it this way for long time now that it could be more optimized with bayes user id vars, currently it creates new ids each time there is a new user, but it does not reuse old ids that is not used anymore after sa-learn --username f...@example.org, then that id is not used

Re: Switching to SQL for user preferences: will I need to transfer the bayes DB?

2020-01-08 Thread Bill Cole
On 8 Jan 2020, at 7:47, Guido Goluke, MajorLabel wrote: Will I need to move the Bayes DB to an 'SQL form' when I switch to SQL-style user preferences? No. The Bayes, user prefs, and TxRep (or AWL) databases each have their own independent configurations -- Bill Cole b...@scconsult.com

Switching to SQL for user preferences: will I need to transfer the bayes DB?

2020-01-08 Thread Guido Goluke, MajorLabel
My setup is that of only virtual e-mail users. The 'vmail' user handles all internal stuff. Right now, I have a cronjob on the vmail user that scans 'learn spam' folders on a couple of mailboxes so that the bayes DB is built up on the vmail user and the learnt false negatives are actually used

Re: Bayes

2019-11-29 Thread Jerry Malcolm
Can I bump this one to the top again?  I had great bayes reports in every email for 30 minutes.  Then nothing for the last three days, even after restarting SA.  Is it possible that my bayes db got corrupted?  The sa-learn --dump magic looks ok as far as I can tell. Thanks for any suggestions

Re: Can someone explain how to read Bayes stats?

2019-11-27 Thread @lbutlr
On 27 Nov 2019, at 06:52, Anders Gustafsson wrote: > 0.000 0 3184 0 non-token data: nspam > 0.000 0 17298 0 non-token data: nham Plenty of spam and ham learned > 0.000 0 1553643652 0 non-token data: oldest atime Oldest data is

Re: Can someone explain how to read Bayes stats?

2019-11-27 Thread Matus UHLAR - fantomas
On 27.11.19 15:52, Anders Gustafsson wrote: pamir:~ # sa-learn --dump magic 0.000 0 3 0 non-token data: bayes db version 0.000 0 3184 0 non-token data: nspam 0.000 0 17298 0 non-token data: nham 0.000 0

Can someone explain how to read Bayes stats?

2019-11-27 Thread Anders Gustafsson
Ie: pamir:~ # sa-learn --dump magic 0.000 0 3 0 non-token data: bayes db version 0.000 0 3184 0 non-token data: nspam 0.000 0 17298 0 non-token data: nham 0.000 0 164549 0 non-token data: ntokens

Re: Bayes

2019-11-26 Thread Jerry Malcolm
This is getting stranger by the minute... After playing around and verifying permissions and everything, I actually started getting a bayes score item for each email.  So I celebrated and went to dinner.  Came back a few hours later and checked the logs.  Bayes consistently added a score line

Re: Bayes

2019-11-26 Thread Jerry Malcolm
On 11/25/2019 3:02 PM, Mikael Syska wrote: Try and run: sa-learn --dump magic Should give you some information like: 0.000  0  3  0  non-token data: bayes db version 0.000  0 493422  0  non-token data: nspam 0.000  0    3867414  0  non

Re: Bayes

2019-11-25 Thread RW
On Mon, 25 Nov 2019 22:02:03 +0100 Mikael Syska wrote: > Try and run: > > sa-learn --dump magic That needs to be run as the spamd user (using su or otherwise) for db files.

Re: Bayes

2019-11-25 Thread Mikael Syska
Try and run: sa-learn --dump magic Should give you some information like: 0.000  0  3  0  non-token data: bayes db version 0.000  0 493422  0  non-token data: nspam 0.000  0    3867414  0  non-token data: nham 0.000  0

Re: Bayes

2019-11-24 Thread RW
On Sun, 24 Nov 2019 12:35:16 -0600 Jerry Malcolm wrote: > Thanks again to everyone who helped me get everything up and running > over the past couple of days. > > Now that I have SA finding my bayes database, I'm curious about bayes > reporting.  My bayes db is not new.  I migra

Bayes

2019-11-24 Thread Jerry Malcolm
Thanks again to everyone who helped me get everything up and running over the past couple of days. Now that I have SA finding my bayes database, I'm curious about bayes reporting.  My bayes db is not new.  I migrated my previous bayes db from my old installation.  And I've been training

Re: some question about bayes learn/score math

2019-11-22 Thread RW
On Fri, 22 Nov 2019 15:15:02 +0100 Philipp Ewald wrote: > Hi folks, > > sorry for asking again but Bayes is a blackbox for me :( > > I have some question about bayes math. How does bayes/spamassassin > calculate this values? > > i'm wonder about : auto-learn?*

some question about bayes learn/score math

2019-11-22 Thread Philipp Ewald
Hi folks, sorry for asking again but Bayes is a blackbox for me :( I have some question about bayes math. How does bayes/spamassassin calculate this values? i'm wonder about : auto-learn?* and "score so far=8.601) dbg: rules: running head tests; score so far=8.601 [...] dbg: rules: ru

Re: Question about Bayes implementation

2019-10-22 Thread RW
On Tue, 22 Oct 2019 12:21:45 +0200 Philipp Ewald wrote: > Hi folks, > > at this point i split all my SPAM mail to get the attachment to > create a hash table. (but this is not my point) > > Its also possible to split my SPAM into html/text, plain/text and > headers to. > Debian package:

Question about Bayes implementation

2019-10-22 Thread Philipp Ewald
Hi folks, at this point i split all my SPAM mail to get the attachment to create a hash table. (but this is not my point) Its also possible to split my SPAM into html/text, plain/text and headers to. Debian package: ripmime Now i ask myself: If i learn spamassassin with my mails should i

Re: Machine learning with or vs. Bayes?

2019-06-27 Thread Shreyansh Shrivastava.
On Fri, 28 Jun 2019, 07:42 Amir Caspi, wrote: > Hi all, > > I don't suppose anyone has a neural-net-based SA Machine Learning plugin > or external program, to complement or replace Bayes? There are a number of > fairly compact Python ML packages that would greatly ease this

Re: Machine learning with or vs. Bayes?

2019-06-27 Thread Olivier
> Of course, Gmail and the other big providers have their own ML solutions that > seem to be pretty good, though they have an enormous user base and > near-infinite resources... I would argue, in contrary, that Gmail performs rather poorly, I have at least one FP a day and that is a big no no.

Machine learning with or vs. Bayes?

2019-06-27 Thread Amir Caspi
Hi all, I don't suppose anyone has a neural-net-based SA Machine Learning plugin or external program, to complement or replace Bayes? There are a number of fairly compact Python ML packages that would greatly ease this task nowadays, like TensorFlow. It looks like rspamd has a neural net

Re: Is Bayes forgetting ?

2019-05-29 Thread Matus UHLAR - fantomas
the same as what? >Bayes db is on Linux. seems I wasn't clear at my question: How do you use spamassassin? milter, amavis, procmail filter, postfix filter ... ? -- Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/ Warning: I wish NOT to receive e-mail advertising to

Re: Is Bayes forgetting ?

2019-05-28 Thread Jim Dunphy
ote: >> On 27.05.19 12:51, hg user wrote: >> >the Linux user is the same. >> the same as what? >> >Bayes db is on Linux. >> seems I wasn't clear at my question: >> How do you use spamassassin? milter, amavis, procmail filter, postfix >>

Re: Is Bayes forgetting ?

2019-05-27 Thread Matus UHLAR - fantomas
On 27.05.19 15:15, hg user wrote: I think, well... I suppose to be doing everything according to zimbra requirements. does zimbra document how to run spamassassin test? I'm trying to undersand if it is possible that the bayes plugin reports different results within a few minutes

Re: Is Bayes forgetting ?

2019-05-27 Thread hg user
ncluded > 0.000 0 3 0 non-token data: bayes db version > 0.000 01042011 0 non-token data: nspam > 0.000 0 66472 0 non-token data: nham > 0.000 0 663479 0 non-token data: ntokens > 0.000

Re: Is Bayes forgetting ?

2019-05-27 Thread hg user
I think, well... I suppose to be doing everything according to zimbra requirements. I'm trying to undersand if it is possible that the bayes plugin reports different results within a few minutes and with no changes to the bayes db... Here are the commands, so that you may tell me if some

Re: Is Bayes forgetting ?

2019-05-27 Thread Matus UHLAR - fantomas
n Mon, May 27, 2019 at 1:18 PM Matus UHLAR - fantomas wrote: On 27.05.19 12:51, hg user wrote: >the Linux user is the same. the same as what? >Bayes db is on Linux. seems I wasn't clear at my question: How do you use spamassassin? milter, amavis, procmail filter, postfix filter ... ?

Re: Is Bayes forgetting ?

2019-05-27 Thread hg user
. On Mon, May 27, 2019 at 1:18 PM Matus UHLAR - fantomas wrote: > On 27.05.19 12:51, hg user wrote: > >the Linux user is the same. > > the same as what? > > >Bayes db is on Linux. > > seems I wasn't clear at my question: > How do you use spamassassin? milter,

Re: Is Bayes forgetting ?

2019-05-27 Thread Matus UHLAR - fantomas
On 27.05.19 12:51, hg user wrote: the Linux user is the same. the same as what? Bayes db is on Linux. seems I wasn't clear at my question: How do you use spamassassin? milter, amavis, procmail filter, postfix filter ... ? -- Matus UHLAR - fantomas, uh...@fantomas.sk ; http

Re: Is Bayes forgetting ?

2019-05-27 Thread hg user
Hi, the Linux user is the same. Bayes db is on Linux.

Re: Is Bayes forgetting ?

2019-05-27 Thread Matus UHLAR - fantomas
On 27.05.19 12:26, hg user wrote: I'm trying to understand what it is happening in my spamassassin 3.4.1 bayes system. I make it learn a new message as spam and it learns it correctly, so that the message is reported BAYES_50. Unfortunately, after a few minutes, and with no new messages learnt

Is Bayes forgetting ?

2019-05-27 Thread hg user
I'm trying to understand what it is happening in my spamassassin 3.4.1 bayes system. I make it learn a new message as spam and it learns it correctly, so that the message is reported BAYES_50. Unfortunately, after a few minutes, and with no new messages learnt (autolearn=no), the message

Re: Is the SA Bayes implementation mathematically sound?

2018-12-24 Thread Rick Macdougall
On 2018-12-24 12:39 p.m., Ian Zimmerman wrote: On 2018-12-23 17:02, Rick Macdougall wrote: I'm just going to jump in here and mention that I train my bayes in SA and in Thunderbird email client. Thunderbird catches 99%+ and SA catches under 60% with the same training data. Have you also

Re: Is the SA Bayes implementation mathematically sound?

2018-12-24 Thread Ian Zimmerman
On 2018-12-23 17:02, Rick Macdougall wrote: > I'm just going to jump in here and mention that I train my bayes in SA > and in Thunderbird email client. > > Thunderbird catches 99%+ and SA catches under 60% with the same > training data. Have you also compared the rates of

Re: Is the SA Bayes implementation mathematically sound?

2018-12-23 Thread Rick Macdougall
Hi, I'm just going to jump in here and mention that I train my bayes in SA and in Thunderbird email client. Thunderbird catches 99%+ and SA catches under 60% with the same training data. Take that as you will because, of course, there is mail that Thunderbird never sees due to SA

Re: Is the SA Bayes implementation mathematically sound?

2018-12-23 Thread Bill Cole
On 22 Dec 2018, at 18:39, Damian wrote: Hi all, is there someone who has a good grasp around the mathematics of Bayes learning with respect to SpamAssassin? Justin Mason would be the best person to discuss this. I do not know if he still reads this list. I assume that training a fresh

Re: Is the SA Bayes implementation mathematically sound?

2018-12-23 Thread Damian
late the probability of a tiny mail being ham or spam via Bayes theorem, then there would be only one correct solution. If the professor wanted to reuse the exercise with different parameters for various exams, maybe he would tell his assistant to create a little software so that he could play arou

Re: Is the SA Bayes implementation mathematically sound?

2018-12-22 Thread RW
On Sun, 23 Dec 2018 00:39:02 +0100 Damian wrote: > Hi all, > > is there someone who has a good grasp around the mathematics of Bayes > learning with respect to SpamAssassin? > > I assume that training a fresh BayesStore with a set of spam and ham > samples is mathematically

Is the SA Bayes implementation mathematically sound?

2018-12-22 Thread Damian
Hi all, is there someone who has a good grasp around the mathematics of Bayes learning with respect to SpamAssassin? I assume that training a fresh BayesStore with a set of spam and ham samples is mathematically sound. What bothers me a little is the expiration logic. The purpose of expiration

Re: Bayes underperforming, HTML entities?

2018-12-07 Thread John Hardin
On Fri, 7 Dec 2018, Amir Caspi wrote: On Dec 6, 2018, at 12:14 PM, John Hardin wrote: Runaway backtracking that was killing masscheck for several people. Hrm, that is disconcerting. I'm not sure where any backtracking might be occurring... This sort of thing is risky, especially in a

Re: Bayes underperforming, HTML entities?

2018-12-07 Thread Amir Caspi
On Dec 6, 2018, at 12:14 PM, John Hardin wrote: > > Runaway backtracking that was killing masscheck for several people. Hrm, that is disconcerting. I'm not sure where any backtracking might be occurring... Can anyone help improve this suggested rule? rawbody AC_HTML_ENTITY_BONANZA_NEW

Re: Bayes underperforming, HTML entities?

2018-12-06 Thread John Hardin
On Tue, 4 Dec 2018, Amir Caspi wrote: On Dec 1, 2018, at 10:31 AM, John Hardin wrote: On Thu, 29 Nov 2018, Amir Caspi wrote: A) Could you sandbox the proposed rule change (AC_HTML_ENTITY_BONANZA_NEW) and see how it performs, including possible FPs? Done. Any preliminary results?

Re: Bayes underperforming, HTML entities?

2018-12-04 Thread John Hardin
On Tue, 4 Dec 2018, Amir Caspi wrote: On Dec 1, 2018, at 10:31 AM, John Hardin wrote: On Thu, 29 Nov 2018, Amir Caspi wrote: A) Could you sandbox the proposed rule change (AC_HTML_ENTITY_BONANZA_NEW) and see how it performs, including possible FPs? Done. Any preliminary results?

Re: Bayes underperforming, HTML entities?

2018-12-04 Thread Amir Caspi
lar" spaces by the HTML interpreter prior to body rules? Or do they get turned into non-breaking space characters which are different than regular spaces? Like all the ZW stuff, this seems like it should get "normalized" so it can be available both in raw and normal form for Baye

Re: Bayes underperforming, HTML entities?

2018-12-01 Thread John Hardin
On Thu, 29 Nov 2018, John Hardin wrote: On Thu, 29 Nov 2018, Amir Caspi wrote: On Nov 29, 2018, at 3:27 PM, John Hardin wrote: I'll see whether those can be incorporated into the existing UNICODE_OBFU_ZW rule (which of course will no longer actually be UNICODE :) ) Great. Maybe rename

Re: Bayes underperforming, HTML entities?

2018-11-30 Thread RW
On Fri, 30 Nov 2018 15:49:31 -0700 Amir Caspi wrote: > > It make it harder to write rules detecting these tricks, but it may > > happen eventually. As far as Bayes is concerned, it would be a > > shame to lose the information. > > I'm not sure I see how Bayes can t

Re: Bayes underperforming, HTML entities?

2018-11-30 Thread Bill Cole
rules, and Bayes, function properly, no? Not if they are *looking for* those characters. But AFAIK we're only looking for those characters with rawbody rules, Not so. because it's really hard to search for them in regular body rules... no? No. See the relevant rule cluster (all with 'ZW

Re: Bayes underperforming, HTML entities?

2018-11-30 Thread Amir Caspi
On Nov 30, 2018, at 7:00 AM, Bill Cole wrote: > >> Since HTML is already getting rendered to text, then perhaps the conversion >> code should strip (literally, just delete) any zero-width characters during >> this conversion? That should make normal body rules

Re: Bayes underperforming, HTML entities?

2018-11-30 Thread RW
on my confusion. Since HTML is already > getting rendered to text, then perhaps the conversion code should > strip (literally, just delete) any zero-width characters during this > conversion? That should make normal body rules, and Bayes, function > properly, no? > > Is there a

<    1   2   3   4   5   6   7   8   9   10   >