Busy at work, aiming for next week.
> On Sep 1, 2016, at 8:44 AM, Ted Yu <yuzhih...@gmail.com> wrote: > > Andrew: > HBASE-16255 has been resolved. > > Kindly provide your feedback. > > Thanks > > On Sat, Aug 20, 2016 at 11:06 AM, Andrew Purtell <andrew.purt...@gmail.com> > wrote: > >> I plan to spin up a test cluster with clusterdock and try running the IT >> under a number of different scenarios. I understand snapshots have to >> function so baseline would be the calm monkey. >> >> Unless you have some other automated way for me to run the new >> functionality repeatedly, the IT is it. >> >>>> On Aug 20, 2016, at 10:59 AM, Vladimir Rodionov <vladrodio...@gmail.com> >>> wrote: >>> >>> Not sure what do you mean, Andrew by "trying out the branch via the IT", >>> but we do not recommend running this with monkey enabled. >>> It has not been tested in a such scenario yet and frankly speaking it is >>> not supposed to work (snapshots will fail anyway and we depends on >>> snapshots) >>> >>> -Vladimir >>> >>> On Sat, Aug 20, 2016 at 10:29 AM, Andrew Purtell < >> andrew.purt...@gmail.com> >>> wrote: >>> >>>> Let's commit the IT to the branch, if you think the v5 patch is ready >> for >>>> commit Ted. >>>> >>>> I will be able to spend some time next week trying out the branch via >> the >>>> IT, and poking around with the new tools. After that I feel like I'll be >>>> informed enough to vote on a branch merge vote. >>>> >>>>> On Aug 19, 2016, at 12:38 PM, Ted Yu <yuzhih...@gmail.com> wrote: >>>>> >>>>> IT test is provided on HBASE-16255. >>>>> >>>>> Any other comment ? >>>>> >>>>> Thanks >>>>> >>>>>> On Tue, Aug 2, 2016 at 9:09 PM, Dima Spivak <dspi...@cloudera.com> >>>> wrote: >>>>>> >>>>>> Any chance for an IT test being added to the branch first? I'd love to >>>> put >>>>>> it through the paces with clusterdock to make sure it behaves well >> with >>>>>> fault injection and the like. >>>>>> >>>>>> -Dima >>>>>> >>>>>>> On Tuesday, August 2, 2016, Ted Yu <yuzhih...@gmail.com> wrote: >>>>>>> >>>>>>> Any more comments from the community on whether the merge can be >>>>>> conducted >>>>>>> ? >>>>>>> >>>>>>> Thanks >>>>>>> >>>>>>> On Mon, Aug 1, 2016 at 12:03 PM, Vladimir Rodionov < >>>>>> vladrodio...@gmail.com >>>>>>> <javascript:;>> >>>>>>> wrote: >>>>>>> >>>>>>>> Carter Shanklin posted a blog article about the feature: >>>>>>>> Some use cases and examples of a command line interface usage. >>>>>>> https://hortonworks.com/blog/coming-hdp-2-5-incremental- >>>>>> backup-restore-apache-hbase-apache-phoenix/ >>>>>>>> >>>>>>>> -Vlad >>>>>>>> >>>>>>>> On Wed, Jul 20, 2016 at 1:25 PM, Vladimir Rodionov < >>>>>>> vladrodio...@gmail.com <javascript:;> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Ok, got it. >>>>>>>>> >>>>>>>>> -Vlad >>>>>>>>> >>>>>>>>> On Wed, Jul 20, 2016 at 12:15 PM, Enis Söztutar <e...@apache.org >>>>>>> <javascript:;>> wrote: >>>>>>>>> >>>>>>>>>> We keep the WALs which can accumulate a lot if the use case is to >>>>>> only >>>>>>>> do >>>>>>>>>> backups infrequently. This will definitely cause issues since HDFS >>>>>>> space >>>>>>>>>> will get filled up. That is why we may need an option for having >>>>>>>>>> incremental backups not used, and WAL references being deleted. >>>>>>>>>> >>>>>>>>>> Enis >>>>>>>>>> >>>>>>>>>> On Tue, Jul 19, 2016 at 6:33 PM, Vladimir Rodionov < >>>>>>>>>> vladrodio...@gmail.com <javascript:;>> >>>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>>> Why anyone will ever need disabling incremental backups? If you >> do >>>>>>> not >>>>>>>>>> need >>>>>>>>>>> it - just run only full backups. >>>>>>>>>>> >>>>>>>>>>> -Vlad >>>>>>>>>>> >>>>>>>>>>> On Tue, Jul 19, 2016 at 6:21 PM, Enis Söztutar <e...@apache.org >>>>>>> <javascript:;>> >>>>>>>> wrote: >>>>>>>>>>> >>>>>>>>>>>> Thanks Matteo for chiming in. >>>>>>>>>>>> >>>>>>>>>>>> On Tue, Jul 19, 2016 at 5:02 PM, Matteo Bertozzi < >>>>>>>>>>> theo.berto...@gmail.com <javascript:;>> >>>>>>>>>>>> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> I did some review in the early beginning, but then lost track >>>>>> of >>>>>>>> the >>>>>>>>>>>>> changes. >>>>>>>>>>>>> but I'd like to give a quick review to the full code once >>>>>> people >>>>>>>>>> here >>>>>>>>>>> are >>>>>>>>>>>>> ok with getting this feature in master (2.0). >>>>>>>>>>>>> (let say we put a deadline for reviews, like 1 week for >>>>>>> reviewing >>>>>>>>>> the >>>>>>>>>>>> full >>>>>>>>>>>>> stuff after everyone agrees to get this in. just to avoid >>>>>>> holding >>>>>>>>>> this >>>>>>>>>>>> for >>>>>>>>>>>>> too long, but still enough time to have people that are >>>>>>> interested >>>>>>>>>> to >>>>>>>>>>>> look >>>>>>>>>>>>> at it. with did the same thing for MOB with a mega patch >>>>>>>>>>>>> https://reviews.apache.org/r/36391/) >>>>>>>>>>>> >>>>>>>>>>>> This sounds good. Vladimir / Ted how do you guys want to handle >>>>>>> the >>>>>>>>>>> merge? >>>>>>>>>>>> As a giant patch or a rebase of code in the branch and through >>>>>> git >>>>>>>>>> merge. >>>>>>>>>>>> >>>>>>>>>>>> We need to run a vote when the to-be-merged branched is ready. >>>>>> We >>>>>>>> can >>>>>>>>>>> set a >>>>>>>>>>>> vote timeout for at least 1 week. >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> most of the code seemed isolated from the beginning, few >>>>>> changes >>>>>>>>>> here >>>>>>>>>>> and >>>>>>>>>>>>> there in the core. >>>>>>>>>>>>> so, this side of things seems ok to me. >>>>>>>>>>>>> >>>>>>>>>>>>> maybe some work to add IT tests as mentioned above, but that >>>>>>>> should >>>>>>>>>> not >>>>>>>>>>>>> take long. >>>>>>>>>>>>> >>>>>>>>>>>>> I don't know if there are already docs, but that is another >>>>>>> thing >>>>>>>> we >>>>>>>>>>> may >>>>>>>>>>>>> want to get in with the merge. >>>>>>>>>>>>> a minimal coverage at least on how to use the feature, and >>>>>> maybe >>>>>>>>>>> calling >>>>>>>>>>>> it >>>>>>>>>>>>> out as experimental? >>>>>>>>>>>>> >>>>>>>>>>>>> my main concern were around incremental backups. >>>>>>>>>>>>> I'm still not convinced around the fact that because the WALs >>>>>>>>>> contain >>>>>>>>>>>>> regions of multiple tables >>>>>>>>>>>>> the incremental backup will keep around WALs with some data >>>>>> that >>>>>>>> we >>>>>>>>>>> don't >>>>>>>>>>>>> really want in the backup (for space or maybe security >>>>>> reason). >>>>>>>>>>>>> >>>>>>>>>>>>> then there was the question about for how long should I take >>>>>>>>>>>> incrementals, >>>>>>>>>>>>> before deciding that a fresh full backup is less costly in >>>>>> terms >>>>>>>> of >>>>>>>>>>>> space. >>>>>>>>>>>>> but I think this incremental merge/compaction was a feature on >>>>>>> the >>>>>>>>>>>> roadmap >>>>>>>>>>>>> as Phase3. >>>>>>>>>>>>> which I think is ok to get later on, >>>>>>>>>>>>> maybe just call out a lifecycle example on the docs under >>>>>> "best >>>>>>>>>>>> practices". >>>>>>>>>>>> >>>>>>>>>>>> I think this will depend on the use case, and other factors like >>>>>>>>>>> bandwidth >>>>>>>>>>>> available, how much data >>>>>>>>>>>> the user is willing to lose in case of catastrophic failure and >>>>>>> how >>>>>>>>>>>> "expensive" is full backup versus >>>>>>>>>>>> incremental one. >>>>>>>>>>>> >>>>>>>>>>>> The full backup should also be useable by default, so maybe we >>>>>> can >>>>>>>>>> make >>>>>>>>>>> an >>>>>>>>>>>> option to not even keep WAL files, and completely disable >>>>>>>> incremental >>>>>>>>>>>> backups? >>>>>>>>>>>> >>>>>>>>>>>> Enis >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> has anyone interested in using backups looked at the doc in >>>>>>>>>> HBASE-7912? >>>>>>>>>>>>> is the current design of incremental backup acceptable for >>>>>>>> everyone >>>>>>>>>>>> wanting >>>>>>>>>>>>> to use this feature? >>>>>>>>>>>>> (maybe this should be a question for the @user list and not >>>>>> dev) >>>>>>>>>>>>> >>>>>>>>>>>>> is there anyone already using this feature or it is just dev >>>>>>>> testing >>>>>>>>>>> it? >>>>>>>>>>>>> to me will be interesting having a use-case/workflow example, >>>>>>>>>>>>> to see if in the real world my concerns about incremental are >>>>>>> not >>>>>>>>>>> showing >>>>>>>>>>>>> up. >>>>>>>>>>>>> >>>>>>>>>>>>> On Tue, Jul 19, 2016 at 1:35 PM, Ted Yu <yuzhih...@gmail.com >>>>>>> <javascript:;>> >>>>>>>>>> wrote: >>>>>>>>>>>>> >>>>>>>>>>>>>> Gentle ping on this subject. >>>>>>>>>>>>>> >>>>>>>>>>>>>> The changes are mostly non-intrusive. >>>>>>>>>>>>>> >>>>>>>>>>>>>> More comments are welcome. >>>>>>>>>>>>>> >>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 9:29 PM, Vladimir Rodionov < >>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;> >>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> Not that hard, Andrew. I will open JIRA. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> -Vlad >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 8:46 PM, Andrew Purtell < >>>>>>>>>>>>>> andrew.purt...@gmail.com <javascript:;>> >>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> How hard would it be to convert what you've been using >>>>>> to >>>>>>>> test >>>>>>>>>>> end >>>>>>>>>>>> to >>>>>>>>>>>>>> end >>>>>>>>>>>>>>>> during dev into an IT? >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On Jul 11, 2016, at 5:31 PM, Vladimir Rodionov < >>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Is there an integration test in hbase-it yet? If >>>>>> not, >>>>>>>> any >>>>>>>>>>> tips >>>>>>>>>>>>> on a >>>>>>>>>>>>>>>>>>> semi-automateable way to take backups and restore >>>>>>> them? >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> We do not have yet, but we have a lot of unit tests. >>>>>> We >>>>>>>>>>> provide 2 >>>>>>>>>>>>> API >>>>>>>>>>>>>>> for >>>>>>>>>>>>>>>>> backup: >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> 1. Admin.getBackupAdmin >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> 2. Command - line via hbase command. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Everything is straightforward. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> -Vlad >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 5:23 PM, Dima Spivak < >>>>>>>>>>>>> dspi...@cloudera.com <javascript:;>> >>>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Is there an integration test in hbase-it yet? If not, >>>>>>> any >>>>>>>>>> tips >>>>>>>>>>>> on >>>>>>>>>>>>> a >>>>>>>>>>>>>>>>>> semi-automateable way to take backups and restore >>>>>> them? >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> -Dima >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 6:42 PM, Vladimir Rodionov < >>>>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;> >>>>>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Sorry, wrong links: >>>>>>>>>>>>>>>>>>> These are the phases: >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Phase 1: >>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- >>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14030 >>>>>>>>> 14030 >>>>>>>>>>>>>>>>>>> Phase 2: >>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- >>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14123 >>>>>>>>> 14123 >>>>>>>>>>>>>>>>>>> Phase 3: >>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- >>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14414 >>>>>>>>> 14414 >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> -Vlad >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 4:41 PM, Vladimir Rodionov < >>>>>>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;> >>>>>>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> These are the phases: >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> Phase 1: >>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- >>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912 >>>>>>>>> 14030 >>>>>>>>>>>>>>>>>>>> Phase 2: >>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- >>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912 >>>>>>>>> 14123 >>>>>>>>>>>>>>>>>>>> Phase 3: >>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- >>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912 >>>>>>>>> 14414 >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> -Vlad >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 12:21 PM, Enis Söztutar < >>>>>>>>>>>>> e...@apache.org <javascript:;>> >>>>>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> As you guys may already be familiar, Vladimir, >>>>>> Ted, >>>>>>>>>> Jerry >>>>>>>>>>> and >>>>>>>>>>>>>>> others >>>>>>>>>>>>>>>>>>> have >>>>>>>>>>>>>>>>>>>>> been developing the backup / restore functionality >>>>>>> in >>>>>>>> a >>>>>>>>>>>> series >>>>>>>>>>>>> of >>>>>>>>>>>>>>>>>> issues >>>>>>>>>>>>>>>>>>>>> committed in the separate branch HBASE-7912[1]. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Backup / Restore functionality is tracked as a >>>>>>> 4-phase >>>>>>>>>>>> project, >>>>>>>>>>>>>> and >>>>>>>>>>>>>>>>>> the >>>>>>>>>>>>>>>>>>>>> first two phases are complete and useable. We are >>>>>>> now >>>>>>>>>>> working >>>>>>>>>>>>> on >>>>>>>>>>>>>>>>>> Phase 3 >>>>>>>>>>>>>>>>>>>>> items, which are mostly improvements. We think >>>>>> that >>>>>>>> the >>>>>>>>>>>> current >>>>>>>>>>>>>>> code >>>>>>>>>>>>>>>>>> in >>>>>>>>>>>>>>>>>>>>> the >>>>>>>>>>>>>>>>>>>>> branch containing all Phase 1 and Phase 2 items, >>>>>> and >>>>>>>>>> some >>>>>>>>>>>>> Phase 3 >>>>>>>>>>>>>>>>>> items >>>>>>>>>>>>>>>>>>> is >>>>>>>>>>>>>>>>>>>>> useable on it's own, and we do not have to wait >>>>>> for >>>>>>>> all >>>>>>>>>> the >>>>>>>>>>>>>>>> subtickets >>>>>>>>>>>>>>>>>>> to >>>>>>>>>>>>>>>>>>>>> be finished to make it completely useable (as >>>>>> follow >>>>>>>> up >>>>>>>>>>>> tickets >>>>>>>>>>>>>> are >>>>>>>>>>>>>>>>>>> mostly >>>>>>>>>>>>>>>>>>>>> improvements or optimizations). The improvements >>>>>> in >>>>>>>> the >>>>>>>>>>> works >>>>>>>>>>>>> are >>>>>>>>>>>>>>> all >>>>>>>>>>>>>>>>>>>>> backwards compatible with the existing stuff. >>>>>> Thus, >>>>>>> we >>>>>>>>>>> would >>>>>>>>>>>>> like >>>>>>>>>>>>>>> to >>>>>>>>>>>>>>>>>>>>> propose that the branch HBASE-7912 be merged into >>>>>>>>>> master. >>>>>>>>>>>> The >>>>>>>>>>>>>>> parent >>>>>>>>>>>>>>>>>>> jira >>>>>>>>>>>>>>>>>>>>> has a design doc that goes into details about the >>>>>>>>>>>>> implementation >>>>>>>>>>>>>>> and >>>>>>>>>>>>>>>>>>>>> design >>>>>>>>>>>>>>>>>>>>> choices in case you are interested[2]. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Most of the changes are largely non-intrusive and >>>>>>>>>> confined >>>>>>>>>>> to >>>>>>>>>>>>> the >>>>>>>>>>>>>>>>>>>>> backup subsystem. >>>>>>>>>>>>>>>>>>>>> The unit tests have been passing on manual runs >>>>>> and >>>>>>> we >>>>>>>>>>>>>>> (hortonworks) >>>>>>>>>>>>>>>>>>> have >>>>>>>>>>>>>>>>>>>>> been running the integration tests as well as some >>>>>>>> other >>>>>>>>>>>>>>> shell-based >>>>>>>>>>>>>>>>>>>>> system >>>>>>>>>>>>>>>>>>>>> tests on a forked version of the code. Most of the >>>>>>>> work >>>>>>>>>> has >>>>>>>>>>>>> been >>>>>>>>>>>>>>>>>>> reviewed >>>>>>>>>>>>>>>>>>>>> by 1, 2 or 3 committers already (mostly Ted, >>>>>> myself >>>>>>>> and >>>>>>>>>>>> Jerry). >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> What do you guys think? Is it time to call a vote? >>>>>>> Any >>>>>>>>>>>> concerns >>>>>>>>>>>>>> or >>>>>>>>>>>>>>>>>>>>> feedback >>>>>>>>>>>>>>>>>>>>> appreciated. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> [1] >>>>>>> https://issues.apache.org/jira/browse/HBASE-7912 >>>>>>>>>>>>>>>>>>>>> [2] >>>>>>> https://issues.apache.org/jira/secure/attachment/12816339/ >>>>>> HBaseBackupAndRestore%20-0.91.pdf >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Enis >>>>>> >>>>>> >>>>>> -- >>>>>> -Dima >>