> On 6 Sep 2019, at 10:58, Thomas Mortagne <thomas.morta...@xwiki.com> wrote:
> 
> On Fri, Sep 6, 2019 at 10:46 AM Vincent Massol <vinc...@massol.net> wrote:
>> 
>> 
>> 
>>> On 6 Sep 2019, at 10:42, Thomas Mortagne <thomas.morta...@xwiki.com> wrote:
>>> 
>>> And why do you guys think about raisin the history to 30 at least
>>> platform pipeline jobs?
>> 
>> We need to first check if we have enough disk space first, it’s going to 
>> consume a lot of it.
> 
> ci.xwiki.org currently use 393G (total, not just Jenkins) and have
> 1.3T available. I check a few history entries of xwiki-platform master
> and the scale seems to be about 70M (90% of which is the log file)
> with something like 10 failing tests so I think we are safe for a
> little while

I hope so because I remember that I had to remove twice the logs for jenkins 
over the summer. It’s a different problem but both use up disk space.

We need to fix the log issue BTW.

Thanks
-Vincent

> 
>> 
>> Also, we would need to monitor closely for perf issues and roll it back if 
>> it doesn’t go well.
> 
> Of course but that's easy.
> 
>> 
>> Thanks
>> -Vincent
>> 
>>> 
>>> On Fri, Sep 6, 2019 at 10:39 AM Vincent Massol <vinc...@massol.net> wrote:
>>>> 
>>>> 
>>>> 
>>>>> On 6 Sep 2019, at 10:35, Thomas Mortagne <thomas.morta...@xwiki.com> 
>>>>> wrote:
>>>>> 
>>>>> On Fri, Sep 6, 2019 at 10:32 AM Vincent Massol <vinc...@massol.net> wrote:
>>>>>> 
>>>>>> Hi Simon,
>>>>>> 
>>>>>>> On 6 Sep 2019, at 10:27, Simon Urli <simon.u...@xwiki.com> wrote:
>>>>>>> 
>>>>>>> Hi all,
>>>>>>> 
>>>>>>> On 05/09/2019 17:40, Simon Urli wrote:
>>>>>>>> On 05/09/2019 17:24, Thomas Mortagne wrote:
>>>>>>>>> On Thu, Sep 5, 2019 at 3:43 PM Simon Urli <simon.u...@xwiki.com> 
>>>>>>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>> Hi everyone,
>>>>>>>>>> 
>>>>>>>>>> reopening this thread since I started to close some flicker issues as
>>>>>>>>>> part of BFD and got comments for those.
>>>>>>>>>> 
>>>>>>>>>> So the last mails on this threads suggested to close the flicker 
>>>>>>>>>> issues
>>>>>>>>>> if we didn't manage to reproduce them locally after a repeated tests,
>>>>>>>>>> and that we didn't see them after a while.
>>>>>>>>>> 
>>>>>>>>>> We didn't vote for those suggestion and I assumed a bit quick that I
>>>>>>>>>> could close some flicker issues that I personally don't remember 
>>>>>>>>>> about
>>>>>>>>>> on the CI after having tested them locally.
>>>>>>>>>> My point for doing that is the same as for the first mail I posted on
>>>>>>>>>> this thread: those flickers are old, and the code did change enough 
>>>>>>>>>> for
>>>>>>>>>> those to be fixed in a way or another.
>>>>>>>>> 
>>>>>>>>> Being old does not always means the code leading to those failures
>>>>>>>>> changed that much.
>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> Now I might be completely wrong, and the flicker to happen again, 
>>>>>>>>>> but I
>>>>>>>>>> don't think it's a problem since we can really easily open back the
>>>>>>>>>> issues if it's the case.
>>>>>>>>>> 
>>>>>>>>>> The other solution IMO is to indeed keep the issue open and in fact 
>>>>>>>>>> to
>>>>>>>>>> never really close them, because we just don't have time to 
>>>>>>>>>> investigate
>>>>>>>>>> each of them properly.
>>>>>>>>>> 
>>>>>>>>>> I really don't see any value of keeping things open and don't act on
>>>>>>>>>> them, that's why I suggest to close them after doing the checks we
>>>>>>>>>> suggested before:
>>>>>>>>>>  1. try to repeat locally the failure;
>>>>>>>>> 
>>>>>>>>> This is totally useless IMO unless you make sure that your computer is
>>>>>>>>> made super slow some way since that's the reason for most of the
>>>>>>>>> flickering tests.
>>>>>>>>> 
>>>>>>>>>>  2. check that we didn't encounter those flickers since last cycle.
>>>>>>>>> 
>>>>>>>>> This one is enough for me but the hard part is to knowing that.
>>>>>>>> Ok, so the proposal is now to check only the age since last time we 
>>>>>>>> saw them of the open flickers before closing them.
>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> So first question, do we all agree on that?
>>>>>>>>>> 
>>>>>>>>>> Then for the second check, Vincent suggested to add some tooling: it
>>>>>>>>>> will be best, but it takes time to do. So on the meantime, as Thomas
>>>>>>>>>> also suggested, we could add a check in the release plan to create or
>>>>>>>>>> update all jira issues that concerns flickers. It would allow us to 
>>>>>>>>>> keep
>>>>>>>>>> some information about the liveness of our flickers.
>>>>>>>>>> 
>>>>>>>>>> So second question, do you agree on that?
>>>>>>>>> 
>>>>>>>>> Depends what it exactly means. Have some dedicated jira field to
>>>>>>>>> indicate when you saw it last ? Comment that you just saw that test
>>>>>>>>> failing again ?
>>>>>>>> My suggestion was about a dedicated JIRA field if possible.
>>>>>>> 
>>>>>>> So, ok if I create a new custom field in JIRA for flickers, called 
>>>>>>> "Date of last failure for flicker”?
>>>>>> 
>>>>>> [snip]
>>>>>> 
>>>>>> I don’t see how it’ll help since it’ll never be up to date, and the old 
>>>>>> value will remain making us think it’s not been flickering for a long 
>>>>>> time.
>>>>> 
>>>>> In my mind the idea is not so much to use this field as a criteria to
>>>>> close an issue but as a criteria to not close it.
>>>> 
>>>> ok, as long as we don’t use it for closing, I’m fine :)
>>>> 
>>>> Thanks
>>>> -Vincent
>>>> 
>>>>> 
>>>>>> 
>>>>>> Thanks
>>>>>> -Vincent
>>>>>> 
>>>>> 
>>>>> 
>>>>> --
>>>>> Thomas Mortagne
>>>> 
>>> 
>>> 
>>> --
>>> Thomas Mortagne
>> 
> 
> 
> -- 
> Thomas Mortagne

Reply via email to