+1 I support removing the retries for the many reasons already mentioned.

- Michael

On Tue, Jul 12, 2022 at 12:35 AM Matteo Merli <mme...@apache.org> wrote:
>
> +1
>
> The retry that repeats the whole maven job is often hiding the real
> test failures that were in the 1st failed job.
> --
> Matteo Merli
> <mme...@apache.org>
>
> On Mon, Jul 11, 2022 at 9:06 PM Anon Hxy <anonhx...@gmail.com> wrote:
> >
> > Hi Boschi
> >
> > I support this plan and I think we need take some effort to make the Pulsar
> > CI more stable.
> >
> > Thanks,
> > Xiaoyu Hou
> >
> > Nicolò Boschi <boschi1...@gmail.com> 于2022年7月11日周一 22:14写道:
> >
> > > Hi all,
> > >
> > > I'd like to start a discussion about the stability of Pulsar CI.
> > >
> > > It is common that some tests suite in our CI times out. This is because
> > > when a test fails the entire suite is retried from the beginning (max 3
> > > times). (example:
> > > https://github.com/apache/pulsar/runs/7281063499?check_suite_focus=true)
> > >
> > > The command-line retries may sound helpful in making the CI green for a
> > > given pull but they actually hide test failures (that may be flakies or
> > > real issues!!).
> > >
> > > Another issue is that you can't easily see the failed test and most of the
> > > time the quickest solution is just to blindly restart the failed jobs. 
> > > This
> > > is not the correct behaviour and it will make the CI less stable over 
> > > time.
> > >
> > > The plan would be:
> > > - Remove the retries (see https://github.com/apache/pulsar/pull/16524)
> > > - Create issue for flaky tests
> > > - Fix them / move to quarantine
> > >
> > > WDYT?
> > >
> > > Thanks,
> > > Nicolò Boschi
> > >

Reply via email to