On 11/12/2017 09:22 AM, Merijn van den Kroonenberg wrote:
On 11/11/2017 11:09 AM, Merijn van den Kroonenberg wrote:
On 11/11/2017 07:45 AM, Merijn van den Kroonenberg wrote:
I saw you applied my patch for ruleqa.cgi and if I read the apache
conf
correctly, then the webserver serves pages directly from the checkout
/usr/local/spamassassin/automc/svn/masses/rule-qa/automc/ruleqa.cgi
Damn stupid me.
As I explained above the webserver serves directly from
/usr/local/spamassassin/automc/svn/masses/rule-qa/automc/ruleqa.cgi
Thats the *svn/masses* !
I am so glad you are helping and able to keep this stuff straight. I
worked on this stuff a lot back in the summer and have forgotten a lot
of what I did. :) I meant to have a cron job setup for the automc user
to do an "svn up ~/svn/masses" but it wasn't there until a few minutes
ago. I had patched the ruleqa.cgi and committed it but the "svn up
~/svn/masses" was needed. I just did this part so it's now active.Â
Sorry about that.
No problem, it hard, like you saw above, even tho I analyzed the
situation, I still managed to mess up my own conclusion the first time ;)
So the change should have gone live immediately if you applied the
patch
directly in svn/trunk/masses (and not in svn/masses).
Thats the wrong way around! Actually for the change to go live it should
be patched in
/usr/local/spamassassin/automc/svn/masses/rule-qa/automc/ruleqa.cgi
But thats in the readonly working copy. I just verified in the resync
account and indeed the ruleqa.cgi has not been patched so my change did
NOT go live.
It should be live now. I manually verified the file at that path.
http://ruleqa.spamassassin.org/?longdatelist=1
Yes I see, and the patch indeed fixes the problem with ruleqa. I think the
2-days-ago pages will now work as expected.
So far good news on my latest patch to the SVN REVISION detection to
find the majority REVISION instead of the latetest REVISION.
But why did last night (net masscheck) fail, it says not enough
contributors, but when you look at the corpus there seem to be enough
contributors? Or were some late?
The logic that was in place in March required a minimum of 10
contributors for ham and spam. I added a "force" arg to work with 8
each to help move things along. I thought we were easily getting 15
good contributors so a few days ago I removed the "force" option from
the cron entry. I noticed this about 5 hours ago and put the "force"
option back. I know, I know. I shouldn't be changing things like this
right now. :)
llanga does stand out as being off a bit. It seems to be a day behind
at least by the date stamp. Do I need to put in a temporary cleanup for
this masschecker to toss it out?
I don't think the problem is llanga itself, I suspect its some of our
scripting messing up.
See:
http://ruleqa.spamassassin.org/20171106-r1814390-n
It has:
20171106-r1814390-n (Viewing)
axb-coi-bulk axb-generic axb-ham-misc darxus ena-week0 ena-week1 ena-week2
ena-week3 giovanni jarif jbrooks llanga mmiroslaw-mails-ham
mmiroslaw-mails-spam thendrikx
and
20171107-r1814390-n
axb-coi-bulk axb-generic axb-ham-misc jarif llanga
So It has the same revision (1814390) masschecked to two consecutive days,
BUT all contributors in 20171107 are also present in 20171106 and in this
case its not only llanga.
I would think the tagged version in SVN that gets put in the staged
rsync directory would be exactly what is reported in the ruleqa.cgi but
the date must be tied to the local run timestamp. This might be
something else we need to fix after we get this going "good enough." I
would like to open bug issues to track this kind of problem once we get
things back to working how they did in March.
So what triggers them to masscheck the same revision again, the next day?
But they also masscheck the newer revision later that same day.
What triggers the masscheck? (I don't know anything about that part yet)
1. I think your ruleqa.cgi patch is helping so the rule promotion script
should work tomorrow.
2. The masscheck processing is supposed to be cron'd to run just after 9
AM UTC as closely as possible by each masschecker but some choose to run
it much later for different reasons like the cost of electricity or
spare computing capacity.