*Hello everyone,* *I was thinking about the latest discussions on the topic. During our last Cassandra Contributor meeting we agreed it would be good to cover in our agenda discussions/brainstorming on different topics. So I would like to suggest to organize a brainstorming session on CASSANDRA-13701 between all people who are interested on the topic. I believe the speed of our CI runs is of common interest. What do you think? Does this sound feasible? Who is in?* *Best regards,* *Ekaterina*
On Sat, 22 Aug 2020 at 9:21, Joshua McKenzie <jmcken...@apache.org> wrote: > I think what Jordan is exploring, and I agree on, is that we need clear > > next steps to help reduce the 75% ish increase in dtest runtime. For > > sponsored contributors using circle to run the entire suites, throwing more > > money at the problem through parallelization isn't a long-term solution. > > > > On Sat, Aug 22, 2020 at 8:40 AM Jeremy Hanna <jeremy.hanna1...@gmail.com> > > wrote: > > > > > I know the dtests take a long time and this will make them longer. As a > > > counter point most people run individual dtests locally and the full set > on > > > dedicated test infrastructure. For the dedicated test infrastructure Mick > > > also improved the wall clock runtime when parallelizing the dtests on > > > https://issues.apache.org/jira/browse/CASSANDRA-16006. > > > > > > Even with the longer dtest full runtime, I firmly believe that for the > > > sake of new users and how hard it is to change num_tokens once data is > > > written, this change to the default of num_tokens is long overdue. > Another > > > hidden benefit of this change is that the dtests will now run bootstraps > > > the way operators should run them in practice with the new defaults. That > > > will make the more common default case much more tested and hopefully > catch > > > regressions in that execution path faster. > > > > > > So while it is not a trivial change in full dtest runtime, the benefits > to > > > the community and project are also not trivial. I’m really grateful to > all > > > who have put in effort to make this a reality and know that new users in > > > 4.0 will benefit from these improved defaults. > > > > > > In other words my non binding vote is to merge this and look to improve > > > execution time separately with that effort not being as urgent for the > > > reasons stated above. > > > > > > Jeremy > > > > > > > On Aug 20, 2020, at 2:49 AM, Mick Semb Wever <m...@apache.org> wrote: > > > > > > > > It was agreed¹ that 4.0 should have the new configuration defaults of > > > > num_tokens: 16 > > > > allocate_tokens_for_local_replication_factor: 3 > > > > > > > > 13701's patches: against cassandra, cassandra-builds, cassandra-dtest, > > > ccm; > > > > are reviewed, tested, and ready to commit. But the ccm and dtest > patches > > > > required ccm having to now start nodes sequentially, and adding some > > > longer > > > > timeout values in the dtests. > > > > > > > > The consequence of this is CI runs now take longer. ci-cassandra.a.o's > > > > dtests take ~30% longer, and circleci's dtests (with vnodes) have gone > > > from > > > > ~22 to ~43 minutes. The general opinion (on slack²) is to commit, and > > > work > > > > on improving ccm and dtest startup times in a subsequent ticket. > > > > > > > > 13701 was intended to be committed before the first beta release > because > > > of > > > > its user-facing changes. But these numbers are significant enough it > > > makes > > > > sense to touch base with dev@ > > > > > > > > Does anyone (strongly) object to the "commit + follow up ticket" > > > approach? > > > > > > > > regards, > > > > Mick > > > > > > > > > > > > ¹ – > > > > > > > > https://lists.apache.org/thread.html/ra829084fcf344e9e96fa5c61cb31e909c8629091993471594b65ea89%40%3Cdev.cassandra.apache.org%3E > > > > ² – https://the-asf.slack.com/archives/CK23JSY2K/p1597747395032600 and > > > > > > > > https://the-asf.slack.com/archives/CK23JSY2K/p1597849774078200?thread_ts=1597762085.048300&cid=CK23JSY2K > > > > >