hg user skrev den 2020-12-09 08:57:
I believe that a SA plugin (like bayes) is able to know the envelope
MAIL FROM and RCPT TO values... is it correct? If it is possible we
"just" have to modify the bayes plugin
provide this patch first and ask later :=)
bayes does not fokus o
user identifier.
> Fair warning - if you go full per-user on a large system, this will MASSIVELY
> balloon the size of your Bayes database, and most users will idle below the
> learning thresholds for quite a long time.
> Can you give an idea of the size calculation? I'm wantin
> a large system, this will MASSIVELY balloon the size of your Bayes database,
> and most users will idle below the learning thresholds for quite a long time.
> Can you give an idea of the size calculation? I'm wanting to do this, but I
> need to figure out how much space I need to
loon the size of your
>> Bayes database, and most users will idle below the learning thresholds
>> for quite a long time.
> Can you give an idea of the size calculation? I'm wanting to do this,
> but I need to figure out how much space I need to allocate per user!
That woul
I believe that a SA plugin (like bayes) is able to know the envelope MAIL
FROM and RCPT TO values... is it correct? If it is possible we "just" have
to modify the bayes plugin
On Tue, Dec 8, 2020 at 10:13 PM Benny Pedersen wrote:
> micah anderson skrev den 2020-12-08 21:54:
&g
micah anderson skrev den 2020-12-08 21:54:
Kris Deugau writes:
There will only be one database and set of tables, but one of the
fields
in each table is the user identifier. Fair warning - if you go full
per-user on a large system, this will MASSIVELY balloon the size of
your
Bayes
Kris Deugau writes:
> There will only be one database and set of tables, but one of the fields
> in each table is the user identifier. Fair warning - if you go full
> per-user on a large system, this will MASSIVELY balloon the size of your
> Bayes database, and most users wil
Benoit Panizzon wrote:
Hi
This may help
<http://svn.apache.org/repos/asf/spamassassin/branches/duncf_masses/sql/README.bayes>
I sort of have the same issue. Unfortunately that does not help, it
merely explains how to store bayes data in a database. But there is
still only one '
On 08 Dec 2020, at 08:36, Benoit Panizzon wrote:
> Adding the list back to CC as I believe this is an interesting topic
> many have pondered over.
Forgot to fix the reply to on this list for some reason. Fixed now.
> Yes, I see that is states 'per user' but I still don't see, how that
Hi
Adding the list back to CC as I believe this is an interesting topic
many have pondered over.
Yes, I see that is states 'per user' but I still don't see, how that
'bayes user' is being set on a per recipient base.
On the email platform there is ONE config file for spamassassin. So if I
set
Hi
> This may help
>
> <http://svn.apache.org/repos/asf/spamassassin/branches/duncf_masses/sql/README.bayes>
I sort of have the same issue. Unfortunately that does not help, it
merely explains how to store bayes data in a database. But there is
still only one 'global' databa
On 07 Dec 2020, at 13:56, micah anderson wrote:
> A per-user setup would let each user do their own thing, but I don't see
> how I can do that because our system doesn't have individual system
> users and I don't see that there are options in the bayes sql
> configuration or per
Hi all,
I've got a site-wide bayes mysql setup. It keeps getting poisoned
quickly, because the user patterns are far too divergent from each
other. One person's spam is another person's ham, nobody is happy.
A per-user setup would let each user do their own thing, but I don't see
how I can do
Alex skrev den 2020-11-30 17:33:
2020-11-30 10:11:46 2772099 [Note] InnoDB: *** (2) WAITING FOR THIS
LOCK TO BE GRANTED:
have you tryed do the bayes with AriaDB ?
note i am not expert at all
Hi,
I recently set up a central database server to store bayes data for a
handful of mail relays to query for bayes info. I've done this in the
past and don't recall there being a problem with read locks, but hoped
someone could explain why I'm now seeing errors/warnings like this
when using sa
On 2020-05-28 15:32, Bert Van de Poel wrote:
Almost all of the email we process are forwarders. It doesn't really
make sense for us to do a non-global bayes db. The large majority of
email we process is also for a uniform group: student organizations at
our local university.
On 28.05.20 21:05
Oh, I had misunderstood you, Matus. My bad! I thought you meant we
should use a separate bayes db for every mailbox user, but now I
understand you were referring to the amavis user which indeed runs
everything.
I just moved the existing bayes db (after stopping amavis of course
On 2020-05-28 15:32, Bert Van de Poel wrote:
Almost all of the email we process are forwarders. It doesn't really
make sense for us to do a non-global bayes db. The large majority of
email we process is also for a uniform group: student organizations at
our local university.
does not matter
On 2020-05-28 15:22, Matus UHLAR - fantomas wrote:
On 28.05.20 13:38, Bert Van de Poel wrote:
We're using a global bayes_path defined in local.cf:
This is your problem imho.
if you use amavis, you need no bayes database, but amavis users',
i guess in /var/lib/amavis/.spamassassin
On 2020-05-28 10:18, Matus UHLAR - fantomas wrote:
I wonder where did these files come from.
did you sety bayes_path in /etc/spamassassin/ ?
setup userprefs file for amavisd, in this file make sure bayes data keep
in amavisd user, not the spamassasin user where there is no write access
On 28.05.20 15:32, Bert Van de Poel wrote:
Almost all of the email we process are forwarders. It doesn't really
make sense for us to do a non-global bayes db. The large majority of
email we process is also for a uniform group: student organizations at
our local university.
you have
Almost all of the email we process are forwarders. It doesn't really
make sense for us to do a non-global bayes db. The large majority of
email we process is also for a uniform group: student organizations at
our local university.
On 28/05/2020 15:22, Matus UHLAR - fantomas wrote
On 28.05.20 13:38, Bert Van de Poel wrote:
We're using a global bayes_path defined in local.cf:
This is your problem imho.
if you use amavis, you need no bayes database, but amavis users',
i guess in /var/lib/amavis/.spamassassin/
On 28/05/2020 10:18, Matus UHLAR - fantomas wrote
We're using a global bayes_path defined in local.cf:
use_bayes 1
use_bayes_rules 1
bayes_auto_learn 1
bayes_expiry_max_db_size 150
bayes_path /var/lib/spamassassin/bayes_db/bayes
bayes_file_mode 0775
bayes_ignore_to spam-analy...@ulyssis.org
bayes_ignore_from spam-analy...@ulyssis.org
of spam and ham
we've received, then enabling autolearn to further train the set. As
Spamassassin runs inside Amavis, the Bayes database files are owned by
the amavis user. This setup works fine, and results for Bayes are
great and growing in accuracy by autolearning.
What was somewhat
Plugin initialization+journal sync would make a lot of sense.
What would be the cleanest solution in that case? It's quite annoying to
receive the same error mail every day. Should we use --cnf to disable
the bayes plugin, or is there a more elegant solution? Should we file a
bug about
On Mon, 25 May 2020 23:34:27 +0200
Bert Van de Poel wrote:
> My question therefore specifically is: what exactly does sa-compile
> do to the bayes database files?
I don't know for sure, but it's probably just a side-effect of
initializing plugins. Possibly it's trying to p
of spam and ham
we've received, then enabling autolearn to further train the set. As
Spamassassin runs inside Amavis, the Bayes database files are owned by
the amavis user. This setup works fine, and results for Bayes are great
and growing in accuracy by autolearning.
What was somewhat confusing
ites through the journal they are
> > lockless.
>
> So, auto-training may be problematic w/r/t locking as well. I presume
> that window is coded to be as small as possible.
I wasn't referring auto-training, that's done at the end under locking,
and picking-up count updates
On Friday, February 14, 2020, 7:46:18 PM GMT+1, John Hardin
wrote:
>> I was looking at it in a bit more detail and it looks like there isn't
>> a reader-writer lock, just write locks for the toks and seen
>> files. As scans defer their writes through the journal they are
>> lockless.
On Fri, 14 Feb 2020, RW wrote:
On Fri, 14 Feb 2020 14:38:05 + (UTC)
Pedro David Marco wrote:
On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW
wrote: That would defeat the object of
having a journal file.>>Even if you are right, it doesn't really
explain anything because it>applies to
On Fri, 14 Feb 2020 14:38:05 + (UTC)
Pedro David Marco wrote:
>
>
> >On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW
> > wrote: That would defeat the object of
> >having a journal file.>>Even if you are right, it doesn't really
> >explain anything because it>applies to everyone using
>On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW
> wrote:
>That would defeat the object of having a journal file.>>Even if you are right,
>it doesn't really explain anything because it>applies to everyone using
>BDB/DBM/SDBM. >>IIWY I'd be looking at what's different for you.
I basically
On Thu, 13 Feb 2020 15:29:46 + (UTC)
Pedro David Marco wrote:
>
>>On 13.02.20 12:30, RW wrote:
> >>Bayes doesn't write on scans (unless it does an opportunistic sync
> >>or
> >expiry):
> >
> >doesn't it record token access times to journa
On Thu, 13 Feb 2020 15:05:57 +0100
Matus UHLAR - fantomas wrote:
> >On Wed, 12 Feb 2020 18:03:01 + (UTC)
> >Pedro David Marco wrote:
> >> i am getting errors from Byes because it is not able to lock Bayes
> >> files... Error log is:
> >> bayes
>On 13.02.20 12:30, RW wrote:
>>Bayes doesn't write on scans (unless it does an opportunistic sync or
>expiry):
>
>doesn't it record token access times to journal?
i think SA always does an EXclusive lock despite the parameters, probably
because of that, Fantomas
I
On Wed, 12 Feb 2020 18:03:01 + (UTC)
Pedro David Marco wrote:
i am getting errors from Byes because it is not able to lock Bayes
files... Error log is:
bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_*
R/W: lock failed: Interrupted system call
SA tries to lock bayes
On Wed, 12 Feb 2020 18:03:01 + (UTC)
Pedro David Marco wrote:
> Hi..
> i am getting errors from Byes because it is not able to lock Bayes
> files... Error log is:
> bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_*
> R/W: lock failed: Interrupted system c
should help too.
and I forgot to add:
storing in redis makes only sense for shared BAYES database.
--
Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu
On 12.02.20 19:02, Pedro David Marco wrote:
Fully aligned with my suspicions...
I love Redis... i think someone posted sometime ago a performance table
showing local / sql / redis ... i will look for it...
but it would not surprise me if remote Redis is even faster than local DB...
last
On 2020-02-13 04:11, John Hardin wrote:
On Wed, 12 Feb 2020, Pedro David Marco wrote:
Hi..
i am getting errors from Byes because it is not able to lock Bayes
files...
Error log is:
bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_*
R/W: lock failed: Interrupted system
On Wed, 12 Feb 2020, Pedro David Marco wrote:
Hi..
i am getting errors from Byes because it is not able to lock Bayes files...
Error log is:
bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock
failed: Interrupted system call
SA tries to lock bayes files always
Kevin A. McGrail skrev den 2020-02-12 23:14:
I don't know for sure but Alex Broens aka axb is a master at Bayes
with Redis. Definitely look for his postings.
lets see sqlite with DBI: supported in spamassassin
i am happy with postgresql
I don't know for sure but Alex Broens aka axb is a master at Bayes with
Redis. Definitely look for his postings.
--
Kevin A. McGrail
Member, Apache Software Foundation
Chair Emeritus Apache SpamAssassin Project
https://www.linkedin.com/in/kmcgrail - 703.798.0171
On Wed, Feb 12, 2020 at 2:02 PM
.
On Wednesday, February 12, 2020, 7:32:42 PM GMT+1, Matus UHLAR - fantomas
wrote:
On 12.02.20 18:03, Pedro David Marco wrote:
>i am getting errors from Byes because it is not able to lock Bayes files...
>Error log is:
> bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_*
On 12.02.20 18:03, Pedro David Marco wrote:
i am getting errors from Byes because it is not able to lock Bayes files...
Error log is:
bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock
failed: Interrupted system call
SA tries to lock bayes files always
Hi..
i am getting errors from Byes because it is not able to lock Bayes files...
Error log is:
bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock
failed: Interrupted system call
SA tries to lock bayes files always in "EXclusive mode", hence when a sa
Bill Cole skrev den 2020-01-25 21:01:
Answering that question requires a more carefully written and detailed
problem description.
sorry for that
Zitat von Benny Pedersen :
i came after using it this way for long time now that it could be
more optimized with bayes user id vars, currently it creates new ids
each time there is a new user, but it does not reuse old ids that is
not used anymore after sa-learn --username f...@example.org
am just unsure if it can be solved in fuglu or it
needs to be supported in spamassassin to have mail alias support ?
another inconvence is that if outbound spamassassin is used, its still
recipient bayes learned, with means its not usefull learned, i had
hoped it was keeped to belearned from
is that if outbound spamassassin is used, its still
recipient bayes learned, with means its not usefull learned, i had hoped
it was keeped to belearned from sender domain with is local, not learned
to remote recipieents
my question is how to solve it in fuglu if it can at all ?
On Sat, 18 Jan 2020 13:31:10 +0100
Benny Pedersen wrote:
> i came after using it this way for long time now that it could be
> more optimized with bayes user id vars, currently it creates new ids
> each time there is a new user, but it does not reuse old ids that is
> not used anym
i came after using it this way for long time now that it could be more
optimized with bayes user id vars, currently it creates new ids each
time there is a new user, but it does not reuse old ids that is not used
anymore after sa-learn --username f...@example.org, then that id is not
used
On 8 Jan 2020, at 7:47, Guido Goluke, MajorLabel wrote:
Will I need to move the Bayes DB to an 'SQL form' when I switch to
SQL-style user preferences?
No. The Bayes, user prefs, and TxRep (or AWL) databases each have their
own independent configurations
--
Bill Cole
b...@scconsult.com
My setup is that of only virtual e-mail users. The 'vmail' user handles
all internal stuff. Right now, I have a cronjob on the vmail user that
scans 'learn spam' folders on a couple of mailboxes so that the bayes DB
is built up on the vmail user and the learnt false negatives are
actually used
Can I bump this one to the top again? I had great bayes reports in
every email for 30 minutes. Then nothing for the last three days, even
after restarting SA. Is it possible that my bayes db got corrupted?
The sa-learn --dump magic looks ok as far as I can tell.
Thanks for any suggestions
On 27 Nov 2019, at 06:52, Anders Gustafsson wrote:
> 0.000 0 3184 0 non-token data: nspam
> 0.000 0 17298 0 non-token data: nham
Plenty of spam and ham learned
> 0.000 0 1553643652 0 non-token data: oldest atime
Oldest data is
On 27.11.19 15:52, Anders Gustafsson wrote:
pamir:~ # sa-learn --dump magic
0.000 0 3 0 non-token data: bayes db version
0.000 0 3184 0 non-token data: nspam
0.000 0 17298 0 non-token data: nham
0.000 0
Ie:
pamir:~ # sa-learn --dump magic
0.000 0 3 0 non-token data: bayes db version
0.000 0 3184 0 non-token data: nspam
0.000 0 17298 0 non-token data: nham
0.000 0 164549 0 non-token data: ntokens
This is getting stranger by the minute... After playing around and
verifying permissions and everything, I actually started getting a bayes
score item for each email. So I celebrated and went to dinner. Came
back a few hours later and checked the logs. Bayes consistently added a
score line
On 11/25/2019 3:02 PM, Mikael Syska wrote:
Try and run:
sa-learn --dump magic
Should give you some information like:
0.000 0 3 0 non-token data: bayes db version
0.000 0 493422 0 non-token data: nspam
0.000 0 3867414 0 non
On Mon, 25 Nov 2019 22:02:03 +0100
Mikael Syska wrote:
> Try and run:
>
> sa-learn --dump magic
That needs to be run as the spamd user (using su or otherwise) for db
files.
Try and run:
sa-learn --dump magic
Should give you some information like:
0.000 0 3 0 non-token data: bayes db version
0.000 0 493422 0 non-token data: nspam
0.000 0 3867414 0 non-token data: nham
0.000 0
On Sun, 24 Nov 2019 12:35:16 -0600
Jerry Malcolm wrote:
> Thanks again to everyone who helped me get everything up and running
> over the past couple of days.
>
> Now that I have SA finding my bayes database, I'm curious about bayes
> reporting. My bayes db is not new. I migra
Thanks again to everyone who helped me get everything up and running
over the past couple of days.
Now that I have SA finding my bayes database, I'm curious about bayes
reporting. My bayes db is not new. I migrated my previous bayes db
from my old installation. And I've been training
On Fri, 22 Nov 2019 15:15:02 +0100
Philipp Ewald wrote:
> Hi folks,
>
> sorry for asking again but Bayes is a blackbox for me :(
>
> I have some question about bayes math. How does bayes/spamassassin
> calculate this values?
>
> i'm wonder about : auto-learn?*
Hi folks,
sorry for asking again but Bayes is a blackbox for me :(
I have some question about bayes math. How does bayes/spamassassin
calculate this values?
i'm wonder about : auto-learn?* and "score so far=8.601)
dbg: rules: running head tests; score so far=8.601
[...]
dbg: rules: ru
On Tue, 22 Oct 2019 12:21:45 +0200
Philipp Ewald wrote:
> Hi folks,
>
> at this point i split all my SPAM mail to get the attachment to
> create a hash table. (but this is not my point)
>
> Its also possible to split my SPAM into html/text, plain/text and
> headers to.
> Debian package:
Hi folks,
at this point i split all my SPAM mail to get the attachment to create a
hash table. (but this is not my point)
Its also possible to split my SPAM into html/text, plain/text and
headers to.
Debian package: ripmime
Now i ask myself:
If i learn spamassassin with my mails should i
On Fri, 28 Jun 2019, 07:42 Amir Caspi, wrote:
> Hi all,
>
> I don't suppose anyone has a neural-net-based SA Machine Learning plugin
> or external program, to complement or replace Bayes? There are a number of
> fairly compact Python ML packages that would greatly ease this
> Of course, Gmail and the other big providers have their own ML solutions that
> seem to be pretty good, though they have an enormous user base and
> near-infinite resources...
I would argue, in contrary, that Gmail performs rather poorly, I have at
least one FP a day and that is a big no no.
Hi all,
I don't suppose anyone has a neural-net-based SA Machine Learning plugin or
external program, to complement or replace Bayes? There are a number of fairly
compact Python ML packages that would greatly ease this task nowadays, like
TensorFlow. It looks like rspamd has a neural net
the same as what?
>Bayes db is on Linux.
seems I wasn't clear at my question:
How do you use spamassassin? milter, amavis, procmail filter, postfix
filter ... ?
--
Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to
ote:
>> On 27.05.19 12:51, hg user wrote:
>> >the Linux user is the same.
>> the same as what?
>> >Bayes db is on Linux.
>> seems I wasn't clear at my question:
>> How do you use spamassassin? milter, amavis, procmail filter, postfix
>>
On 27.05.19 15:15, hg user wrote:
I think, well... I suppose to be doing everything according to zimbra
requirements.
does zimbra document how to run spamassassin test?
I'm trying to undersand if it is possible that the bayes plugin reports
different results within a few minutes
ncluded
> 0.000 0 3 0 non-token data: bayes db version
> 0.000 01042011 0 non-token data: nspam
> 0.000 0 66472 0 non-token data: nham
> 0.000 0 663479 0 non-token data: ntokens
> 0.000
I think, well... I suppose to be doing everything according to zimbra
requirements.
I'm trying to undersand if it is possible that the bayes plugin reports
different results within a few minutes and with no changes to the bayes
db...
Here are the commands, so that you may tell me if some
n Mon, May 27, 2019 at 1:18 PM Matus UHLAR - fantomas
wrote:
On 27.05.19 12:51, hg user wrote:
>the Linux user is the same.
the same as what?
>Bayes db is on Linux.
seems I wasn't clear at my question:
How do you use spamassassin? milter, amavis, procmail filter, postfix
filter ... ?
.
On Mon, May 27, 2019 at 1:18 PM Matus UHLAR - fantomas
wrote:
> On 27.05.19 12:51, hg user wrote:
> >the Linux user is the same.
>
> the same as what?
>
> >Bayes db is on Linux.
>
> seems I wasn't clear at my question:
> How do you use spamassassin? milter,
On 27.05.19 12:51, hg user wrote:
the Linux user is the same.
the same as what?
Bayes db is on Linux.
seems I wasn't clear at my question:
How do you use spamassassin? milter, amavis, procmail filter, postfix
filter ... ?
--
Matus UHLAR - fantomas, uh...@fantomas.sk ; http
Hi,
the Linux user is the same.
Bayes db is on Linux.
On 27.05.19 12:26, hg user wrote:
I'm trying to understand what it is happening in my spamassassin 3.4.1
bayes system.
I make it learn a new message as spam and it learns it correctly, so that
the message is reported BAYES_50.
Unfortunately, after a few minutes, and with no new messages learnt
I'm trying to understand what it is happening in my spamassassin 3.4.1
bayes system.
I make it learn a new message as spam and it learns it correctly, so that
the message is reported BAYES_50.
Unfortunately, after a few minutes, and with no new messages learnt
(autolearn=no), the message
On 2018-12-24 12:39 p.m., Ian Zimmerman wrote:
On 2018-12-23 17:02, Rick Macdougall wrote:
I'm just going to jump in here and mention that I train my bayes in SA
and in Thunderbird email client.
Thunderbird catches 99%+ and SA catches under 60% with the same
training data.
Have you also
On 2018-12-23 17:02, Rick Macdougall wrote:
> I'm just going to jump in here and mention that I train my bayes in SA
> and in Thunderbird email client.
>
> Thunderbird catches 99%+ and SA catches under 60% with the same
> training data.
Have you also compared the rates of
Hi,
I'm just going to jump in here and mention that I train my bayes in SA
and in Thunderbird email client.
Thunderbird catches 99%+ and SA catches under 60% with the same training
data.
Take that as you will because, of course, there is mail that Thunderbird
never sees due to SA
On 22 Dec 2018, at 18:39, Damian wrote:
Hi all,
is there someone who has a good grasp around the mathematics of Bayes
learning with respect to SpamAssassin?
Justin Mason would be the best person to discuss this. I do not know if
he still reads this list.
I assume that training a fresh
late the probability of a tiny mail being ham or spam
via Bayes theorem, then there would be only one correct solution.
If the professor wanted to reuse the exercise with different parameters
for various exams, maybe he would tell his assistant to create a little
software so that he could play arou
On Sun, 23 Dec 2018 00:39:02 +0100
Damian wrote:
> Hi all,
>
> is there someone who has a good grasp around the mathematics of Bayes
> learning with respect to SpamAssassin?
>
> I assume that training a fresh BayesStore with a set of spam and ham
> samples is mathematically
Hi all,
is there someone who has a good grasp around the mathematics of Bayes
learning with respect to SpamAssassin?
I assume that training a fresh BayesStore with a set of spam and ham
samples is mathematically sound. What bothers me a little is the
expiration logic.
The purpose of expiration
On Fri, 7 Dec 2018, Amir Caspi wrote:
On Dec 6, 2018, at 12:14 PM, John Hardin wrote:
Runaway backtracking that was killing masscheck for several people.
Hrm, that is disconcerting. I'm not sure where any backtracking might be
occurring...
This sort of thing is risky, especially in a
On Dec 6, 2018, at 12:14 PM, John Hardin wrote:
>
> Runaway backtracking that was killing masscheck for several people.
Hrm, that is disconcerting. I'm not sure where any backtracking might be
occurring...
Can anyone help improve this suggested rule?
rawbody AC_HTML_ENTITY_BONANZA_NEW
On Tue, 4 Dec 2018, Amir Caspi wrote:
On Dec 1, 2018, at 10:31 AM, John Hardin wrote:
On Thu, 29 Nov 2018, Amir Caspi wrote:
A) Could you sandbox the proposed rule change (AC_HTML_ENTITY_BONANZA_NEW) and
see how it performs, including possible FPs?
Done.
Any preliminary results?
On Tue, 4 Dec 2018, Amir Caspi wrote:
On Dec 1, 2018, at 10:31 AM, John Hardin wrote:
On Thu, 29 Nov 2018, Amir Caspi wrote:
A) Could you sandbox the proposed rule change (AC_HTML_ENTITY_BONANZA_NEW) and
see how it performs, including possible FPs?
Done.
Any preliminary results?
lar" spaces by the HTML
interpreter prior to body rules? Or do they get turned into non-breaking space
characters which are different than regular spaces? Like all the ZW stuff,
this seems like it should get "normalized" so it can be available both in raw
and normal form for Baye
On Thu, 29 Nov 2018, John Hardin wrote:
On Thu, 29 Nov 2018, Amir Caspi wrote:
On Nov 29, 2018, at 3:27 PM, John Hardin wrote:
I'll see whether those can be incorporated into the existing
UNICODE_OBFU_ZW rule (which of course will no longer actually be UNICODE
:) )
Great. Maybe rename
On Fri, 30 Nov 2018 15:49:31 -0700
Amir Caspi wrote:
> > It make it harder to write rules detecting these tricks, but it may
> > happen eventually. As far as Bayes is concerned, it would be a
> > shame to lose the information.
>
> I'm not sure I see how Bayes can t
rules, and Bayes, function properly, no?
Not if they are *looking for* those characters.
But AFAIK we're only looking for those characters with rawbody rules,
Not so.
because it's really hard to search for them in regular body rules...
no?
No.
See the relevant rule cluster (all with 'ZW
On Nov 30, 2018, at 7:00 AM, Bill Cole
wrote:
>
>> Since HTML is already getting rendered to text, then perhaps the conversion
>> code should strip (literally, just delete) any zero-width characters during
>> this conversion? That should make normal body rules
on my confusion. Since HTML is already
> getting rendered to text, then perhaps the conversion code should
> strip (literally, just delete) any zero-width characters during this
> conversion? That should make normal body rules, and Bayes, function
> properly, no?
>
> Is there a
101 - 200 of 4555 matches
Mail list logo