*Hello everyone,*
*I was thinking about the latest discussions on the topic. During our last
Cassandra Contributor meeting we agreed it would be good to cover in our
agenda discussions/brainstorming on different topics. So I would like to
suggest to organize a brainstorming session on CASSANDRA-13701 between all
people who are interested on the topic. I believe the speed of our CI runs
is of common interest. What do you think? Does this sound feasible? Who is
in?*
*Best regards,*
*Ekaterina*

On Sat, 22 Aug 2020 at 9:21, Joshua McKenzie <jmcken...@apache.org> wrote:

> I think what Jordan is exploring, and I agree on, is that we need clear
>
> next steps to help reduce the 75% ish increase in dtest runtime. For
>
> sponsored contributors using circle to run the entire suites, throwing more
>
> money at the problem through parallelization isn't a long-term solution.
>
>
>
> On Sat, Aug 22, 2020 at 8:40 AM Jeremy Hanna <jeremy.hanna1...@gmail.com>
>
> wrote:
>
>
>
> > I know the dtests take a long time and this will make them longer. As a
>
> > counter point most people run individual dtests locally and the full set
> on
>
> > dedicated test infrastructure. For the dedicated test infrastructure Mick
>
> > also improved the wall clock runtime when parallelizing the dtests on
>
> > https://issues.apache.org/jira/browse/CASSANDRA-16006.
>
> >
>
> > Even with the longer dtest full runtime, I firmly believe that for the
>
> > sake of new users and how hard it is to change num_tokens once data is
>
> > written, this change to the default of num_tokens is long overdue.
> Another
>
> > hidden benefit of this change is that the dtests will now run bootstraps
>
> > the way operators should run them in practice with the new defaults. That
>
> > will make the more common default case much more tested and hopefully
> catch
>
> > regressions in that execution path faster.
>
> >
>
> > So while it is not a trivial change in full dtest runtime, the benefits
> to
>
> > the community and project are also not trivial. I’m really grateful to
> all
>
> > who have put in effort to make this a reality and know that new users in
>
> > 4.0 will benefit from these improved defaults.
>
> >
>
> > In other words my non binding vote is to merge this and look to improve
>
> > execution time separately with that effort not being as urgent for the
>
> > reasons stated above.
>
> >
>
> > Jeremy
>
> >
>
> > > On Aug 20, 2020, at 2:49 AM, Mick Semb Wever <m...@apache.org> wrote:
>
> > >
>
> > > It was agreed¹ that 4.0 should have the new configuration defaults of
>
> > >  num_tokens: 16
>
> > >  allocate_tokens_for_local_replication_factor: 3
>
> > >
>
> > > 13701's patches: against cassandra, cassandra-builds, cassandra-dtest,
>
> > ccm;
>
> > > are reviewed, tested, and ready to commit. But the ccm and dtest
> patches
>
> > > required ccm having to now start nodes sequentially, and adding some
>
> > longer
>
> > > timeout values in the dtests.
>
> > >
>
> > > The consequence of this is CI runs now take longer. ci-cassandra.a.o's
>
> > > dtests take ~30% longer, and circleci's dtests (with vnodes) have gone
>
> > from
>
> > > ~22 to ~43 minutes. The general opinion (on slack²) is to commit, and
>
> > work
>
> > > on improving ccm and dtest startup times in a subsequent ticket.
>
> > >
>
> > > 13701 was intended to be committed before the first beta release
> because
>
> > of
>
> > > its user-facing changes. But these numbers are significant enough it
>
> > makes
>
> > > sense to touch base with dev@
>
> > >
>
> > > Does anyone (strongly) object to the "commit + follow up ticket"
>
> > approach?
>
> > >
>
> > > regards,
>
> > > Mick
>
> > >
>
> > >
>
> > > ¹ –
>
> > >
>
> >
> https://lists.apache.org/thread.html/ra829084fcf344e9e96fa5c61cb31e909c8629091993471594b65ea89%40%3Cdev.cassandra.apache.org%3E
>
> > > ² – https://the-asf.slack.com/archives/CK23JSY2K/p1597747395032600 and
>
> > >
>
> >
> https://the-asf.slack.com/archives/CK23JSY2K/p1597849774078200?thread_ts=1597762085.048300&cid=CK23JSY2K
>
> >
>
>

Reply via email to