On Fri, Jul 21, 2017 at 5:30 PM, Mike Drob <md...@apache.org> wrote: > > One thing that is not clear to me from your summary and the linked document > is which release we can block on tests? Unit tests are in bad shape (but > getting better) and I recently started looking at ITs which also need some > care. > > Dunno. What you think boss? I'd say we can't release if tests are a mess. Should we block on RC (as in alpha, beta, then RC)? Or before then? i.e. no beta unless all tests (unit and IT) are passing.
For sure testing -- as in exercising new features, changed configs, and just straight longevity under stress -- is badly wanting. I should write up a matrix I suppose? Let me work on it. Thanks Mike, S > Mike > > On Fri, Jul 21, 2017 at 11:03 AM, Stack <st...@duboce.net> wrote: > > > Status update girls and boys! > > > > hbase-2.0.0-alpha1 went out June 22nd. > > > > alpha2 has been a bit slow to follow (holidays) though there has been > > steady progress closing out blockers and criticals by a bunch of you all. > > The plan is for a release in the first week or so of August. It should be > > fully up on hbase-thirdparty using updated (and relocated) versions of > > netty, guava, and protobuf as well as a default deploy that has > > master-carrying-no-regions. > > > > alpha3 will follow soon after and will focus on making sure our > user-facing > > APIs are clean (branch-1 compatible, no illicit removals/mods, and so on) > > and that basic upgrade 'works'. > > > > betas start in September? > > > > I've been keeping a rough general state here [1] (please update any > section > > that is lagging actuality) but for details on what blockers and criticals > > remain, see the JIRA 2.0 view [2]. Recent issue-gardening has brought 2.0 > > into better focus. Feel free to review and punt items you think can wait > > till 3.0 or 2.1. If you want to pull in more stuff, please ask first. > > > > Thanks, > > St.Ack > > > > 1. > > https://docs.google.com/document/d/1WCsVlnHjJeKUcl7wHwqb4z9iEu_ > > ktczrlKHK8N4SZzs/edit# > > 2. https://issues.apache.org/jira/projects/HBASE/versions/12327188 > > > > > > On Mon, May 15, 2017 at 4:05 PM, Stack <st...@duboce.net> wrote: > > > > > > > > On Mon, May 15, 2017 at 1:46 AM, 张铎(Duo Zhang) <palomino...@gmail.com> > > > wrote: > > > > > >> HBASE-18037 is a new blocker. I'm currently working on it, will be > > >> finished > > >> soon I think. > > >> > > >> I made it a blocker then and added it to our hbase2 release doc [1] > list > > > as a blocker. > > > > > > Thanks, > > > St.Ack > > > > > > 1. https://docs.google.com/document/d/1WCsVlnHjJeKUcl7wHwqb4z9iEu_ > > > ktczrlKHK8N4SZzs/edit# > > > > > > > > >> 2017-05-15 14:12 GMT+08:00 Stack <st...@duboce.net>: > > >> > > >> > > >> > A month on. Status. > > >> > > > >> > I've been working on the HBASE-14614 branch cluster testing. After a > > >> load > > >> > of fixing, the branch passes smaller test runs (an hour or so of > ITBLL > > >> up > > >> > to 2B rows w/ killing monkeys). When I go larger, to a scale I've > not > > >> done > > >> > in a while, I start to run into other interesting issues -- some of > > >> which > > >> > are related to AMv2 (I'm fixing), but others are not (100G WALs that > > >> take > > >> > ten minutes to split makes for interesting cascades when monkeys > kill > > >> > inside the ten minutes...). I intend to keep on with this larger > scale > > >> > testing since it is uncovering good stuff (especially when HDFS is > dog > > >> slow > > >> > because of background replications) but my thinking is that I should > > be > > >> > large scale testing branch-2, not just HBASE-14614. I think > > HBASE-14614, > > >> > the new AMv2, is good enough to merge to master these times. Given > it > > is > > >> > the last blocker, once in, I'll cut the hbase2 branch. > > >> > > > >> > I'll start up a 'Merge HBASE-14614' DISCUSSION thread in the next > day > > >> or so > > >> > (I need to fix some unit tests...). > > >> > > > >> > The AMv2 doc is still a work in progress but should give a gist on > > >> where we > > >> > are currently[1]. There is a bunch of todo still but seems > tractable; > > >> e.g. > > >> > rolling upgrade, finish doc., and we don't have an HBCK since it > needs > > >> to > > >> > be recast in light of how stuff now works but a redo on HBCK is > > >> premature > > >> > given we don't know failure types as yet (we just fix the problems > as > > >> they > > >> > come up). > > >> > > > >> > St.Ack > > >> > 1. > > >> > https://docs.google.com/document/d/1eVKa7FHdeoJ1- > > >> > 9o8yZcOTAQbv0u0bblBlCCzVSIn69g/edit# > > >> > > > >> > > > >> > > > >> > On Thu, Apr 13, 2017 at 1:43 PM, Stack <st...@duboce.net> wrote: > > >> > > > >> > > Some status: > > >> > > > > >> > > AMv2 (HBASE-14614) is near to passing all tests caveat my > disabling > > of > > >> > all > > >> > > to-do w/ fsck (fsck needs revamp) and tests that expect that they > > can > > >> > move > > >> > > hbase;meta off master (AMv2 enforces this constraint; it is > supposed > > >> to > > >> > be > > >> > > enforced on AMv1 but meta-on-master is incompletely realized in > AMv1 > > >> and > > >> > > AMv2). A few other tests have been disabled for various reasons. > See > > >> [1] > > >> > > for full list. > > >> > > > > >> > > There is a hefty list of TODOs still (Again see the messy doc [1]) > > but > > >> > the > > >> > > only 'blocker', IMO, is community confidence in AMv2. Currently, > > >> cluster > > >> > > tests with chaos fail (new form of 'stuck' regions). Takes time > > >> > > investigating. > > >> > > > > >> > > Will keep you all posted. > > >> > > St.Ack > > >> > > > > >> > > > > >> > > > > >> > > 1. https://docs.google.com/document/d/1eVKa7FHdeoJ1- > > >> > > 9o8yZcOTAQbv0u0bblBlCCzVSIn69g/edit#heading=h.92vclum0bvod > > >> > > > > >> > > > > >> > > > > >> > > On Fri, Mar 31, 2017 at 1:14 PM, Andrew Purtell < > > apurt...@apache.org> > > >> > > wrote: > > >> > > > > >> > >> +1 on branching (yay!) > > >> > >> > > >> > >> I have EC2 resources for running ITBLL etc. > > >> > >> > > >> > >> > > >> > >> On Thu, Mar 30, 2017 at 5:07 PM, Stack <st...@duboce.net> wrote: > > >> > >> > > >> > >> > Some notes on progress toward hbase2. > > >> > >> > > > >> > >> > Given that stability and performance are NOT emergent behaviors > > but > > >> > >> rather > > >> > >> > projects unto themselves, my thought is that we commit all that > > >> we've > > >> > >> > agreed as core for hbase2 (see [1]), branch, and then work on > > >> > >> stabilizing > > >> > >> > and perf rather than do stabilize, commit, and then branch. > What > > >> this > > >> > >> means > > >> > >> > in practice is that for features like Inmemory Compaction, we > > >> commit > > >> > it > > >> > >> > defaulted 'on' ("BASIC" mode) which is what we want in hbase2. > > >> Should > > >> > it > > >> > >> > prove problematic under test, we disable it before release. > > >> > >> > > > >> > >> > Are folks good w/ this mode? I ask because, in a few issues > there > > >> are > > >> > >> > requests for proof that a master feature is 'stable' before > > commit. > > >> > >> This is > > >> > >> > normally a healthy request only in master's case, it is hard to > > >> > >> demonstrate > > >> > >> > stability given its current state. > > >> > >> > > > >> > >> > Other outstanding issues such as decisions about whether master > > >> hosts > > >> > >> > system tables only (by default), I'm thinking, we can work out > > post > > >> > >> branch > > >> > >> > in alpha/betas before release. > > >> > >> > > > >> > >> > The awkward item is the long-pole Assignment Manager. This is > an > > >> > >> > all-or-nothing affair. Here we are switching in a new Master > > core. > > >> > >> While I > > >> > >> > think it fine that AMv2 is incomplete come branch time, those > of > > us > > >> > >> working > > >> > >> > on the new AM still need to demonstrate to you all that it > > >> basically > > >> > >> > viable. > > >> > >> > > > >> > >> > The point-of-no-return is commit of the patch in HBASE-14614. > > >> > >> HBASE-14614 > > >> > >> > (AMv2) is coming close to passing all unit tests. We'll spend > > some > > >> > time > > >> > >> > running it on a cluster to make sure it fundamentally sound and > > >> will > > >> > >> report > > >> > >> > back on our experience. There has been an ask for some dev doc > > and > > >> > >> > low-levels on how it works (in progress). Let satisfaction of > > these > > >> > >> > requests be blockers on commit. We'll put the HBASE-14614 > commit > > up > > >> > for > > >> > >> a > > >> > >> > vote on dev list given its import. > > >> > >> > > > >> > >> > Branch will happen after HBASE-14614 goes in (or its rejection) > > >> with > > >> > our > > >> > >> > first alpha soon after. Its looking like a week or two at least > > >> given > > >> > >> how > > >> > >> > things have been going up to this. > > >> > >> > > > >> > >> > I intend to start in on hbase2 stability/perf projects after we > > >> > branch. > > >> > >> > > > >> > >> > Interested in any thoughts you all might have on the above > (Would > > >> also > > >> > >> > appreciate updates on state in [1] if you are a feature owner). > > >> > >> > > > >> > >> > Thanks, > > >> > >> > St.Ack > > >> > >> > > > >> > >> > 1. https://docs.google.com/document/d/1WCsVlnHjJeKUcl7wHwqb4 > > >> > >> > z9iEu_ktczrlKHK8N4SZzs/edit# > > >> > >> > > > >> > >> > > > >> > >> > > > >> > >> > On Sat, Mar 11, 2017 at 5:32 PM, Josh Elser <els...@apache.org > > > > >> > wrote: > > >> > >> > > > >> > >> > > > > >> > >> > > Stack wrote: > > >> > >> > > > > >> > >> > >> On Tue, Mar 7, 2017 at 1:51 PM, Josh Elser< > els...@apache.org> > > >> > >> wrote: > > >> > >> > >> > > >> > >> > >> Thanks for pulling in the FS Quotas work, Stack. I'm trying > to > > >> > cross > > >> > >> the > > >> > >> > >>> last T's and dot the last I's. > > >> > >> > >>> > > >> > >> > >>> The biggest thing I know I need to do still is to write a > new > > >> > >> chapter > > >> > >> > to > > >> > >> > >>> the book. After that, I'd start entertaining larger > > >> > >> reviews/discussions > > >> > >> > >>> to > > >> > >> > >>> merge the feature into master. Anyone with free time > > (giggles) > > >> is > > >> > >> more > > >> > >> > >>> than > > >> > >> > >>> welcome to start perusing :) > > >> > >> > >>> > > >> > >> > >>> > > >> > >> > >>> Out of interest, this could come in after 2.0 Josh? Any 2.0 > > >> > specific > > >> > >> > >> needs > > >> > >> > >> to make this work? > > >> > >> > >> > > >> > >> > >> Meantime, updated the 2.0 doc 1. > > >> > >> > >> > > >> > >> > >> Thanks Josh, > > >> > >> > >> St.Ack > > >> > >> > >> > > >> > >> > >> 1. > > >> > >> > >> https://docs.google.com/document/d/ > 1WCsVlnHjJeKUcl7wHwqb4z9i > > >> > >> > >> Eu_ktczrlKHK8N4SZzs/edit# > > >> > >> > >> > > >> > >> > >> > > >> > >> > > Nope, no need to block 2.0 on this one (given the other, > > related > > >> > >> > chatter). > > >> > >> > > Would be nice to get it in, but I completely understand if it > > >> slips > > >> > :) > > >> > >> > > > > >> > >> > > Thanks for updating the doc for me! > > >> > >> > > > > >> > >> > > > >> > >> > > >> > >> > > >> > >> > > >> > >> -- > > >> > >> Best regards, > > >> > >> > > >> > >> - Andy > > >> > >> > > >> > >> If you are given a choice, you believe you have acted freely. - > > >> Raymond > > >> > >> Teller (via Peter Watts) > > >> > >> > > >> > > > > >> > > > > >> > > > >> > > > > > > > > >