Andrew: HBASE-16255 has been resolved. Kindly provide your feedback.
Thanks On Sat, Aug 20, 2016 at 11:06 AM, Andrew Purtell <andrew.purt...@gmail.com> wrote: > I plan to spin up a test cluster with clusterdock and try running the IT > under a number of different scenarios. I understand snapshots have to > function so baseline would be the calm monkey. > > Unless you have some other automated way for me to run the new > functionality repeatedly, the IT is it. > > > On Aug 20, 2016, at 10:59 AM, Vladimir Rodionov <vladrodio...@gmail.com> > wrote: > > > > Not sure what do you mean, Andrew by "trying out the branch via the IT", > > but we do not recommend running this with monkey enabled. > > It has not been tested in a such scenario yet and frankly speaking it is > > not supposed to work (snapshots will fail anyway and we depends on > > snapshots) > > > > -Vladimir > > > > On Sat, Aug 20, 2016 at 10:29 AM, Andrew Purtell < > andrew.purt...@gmail.com> > > wrote: > > > >> Let's commit the IT to the branch, if you think the v5 patch is ready > for > >> commit Ted. > >> > >> I will be able to spend some time next week trying out the branch via > the > >> IT, and poking around with the new tools. After that I feel like I'll be > >> informed enough to vote on a branch merge vote. > >> > >>> On Aug 19, 2016, at 12:38 PM, Ted Yu <yuzhih...@gmail.com> wrote: > >>> > >>> IT test is provided on HBASE-16255. > >>> > >>> Any other comment ? > >>> > >>> Thanks > >>> > >>>> On Tue, Aug 2, 2016 at 9:09 PM, Dima Spivak <dspi...@cloudera.com> > >> wrote: > >>>> > >>>> Any chance for an IT test being added to the branch first? I'd love to > >> put > >>>> it through the paces with clusterdock to make sure it behaves well > with > >>>> fault injection and the like. > >>>> > >>>> -Dima > >>>> > >>>>> On Tuesday, August 2, 2016, Ted Yu <yuzhih...@gmail.com> wrote: > >>>>> > >>>>> Any more comments from the community on whether the merge can be > >>>> conducted > >>>>> ? > >>>>> > >>>>> Thanks > >>>>> > >>>>> On Mon, Aug 1, 2016 at 12:03 PM, Vladimir Rodionov < > >>>> vladrodio...@gmail.com > >>>>> <javascript:;>> > >>>>> wrote: > >>>>> > >>>>>> Carter Shanklin posted a blog article about the feature: > >>>>>> Some use cases and examples of a command line interface usage. > >>>>> https://hortonworks.com/blog/coming-hdp-2-5-incremental- > >>>> backup-restore-apache-hbase-apache-phoenix/ > >>>>>> > >>>>>> -Vlad > >>>>>> > >>>>>> On Wed, Jul 20, 2016 at 1:25 PM, Vladimir Rodionov < > >>>>> vladrodio...@gmail.com <javascript:;> > >>>>>> wrote: > >>>>>> > >>>>>>> Ok, got it. > >>>>>>> > >>>>>>> -Vlad > >>>>>>> > >>>>>>> On Wed, Jul 20, 2016 at 12:15 PM, Enis Söztutar <e...@apache.org > >>>>> <javascript:;>> wrote: > >>>>>>> > >>>>>>>> We keep the WALs which can accumulate a lot if the use case is to > >>>> only > >>>>>> do > >>>>>>>> backups infrequently. This will definitely cause issues since HDFS > >>>>> space > >>>>>>>> will get filled up. That is why we may need an option for having > >>>>>>>> incremental backups not used, and WAL references being deleted. > >>>>>>>> > >>>>>>>> Enis > >>>>>>>> > >>>>>>>> On Tue, Jul 19, 2016 at 6:33 PM, Vladimir Rodionov < > >>>>>>>> vladrodio...@gmail.com <javascript:;>> > >>>>>>>> wrote: > >>>>>>>> > >>>>>>>>> Why anyone will ever need disabling incremental backups? If you > do > >>>>> not > >>>>>>>> need > >>>>>>>>> it - just run only full backups. > >>>>>>>>> > >>>>>>>>> -Vlad > >>>>>>>>> > >>>>>>>>> On Tue, Jul 19, 2016 at 6:21 PM, Enis Söztutar <e...@apache.org > >>>>> <javascript:;>> > >>>>>> wrote: > >>>>>>>>> > >>>>>>>>>> Thanks Matteo for chiming in. > >>>>>>>>>> > >>>>>>>>>> On Tue, Jul 19, 2016 at 5:02 PM, Matteo Bertozzi < > >>>>>>>>> theo.berto...@gmail.com <javascript:;>> > >>>>>>>>>> wrote: > >>>>>>>>>> > >>>>>>>>>>> I did some review in the early beginning, but then lost track > >>>> of > >>>>>> the > >>>>>>>>>>> changes. > >>>>>>>>>>> but I'd like to give a quick review to the full code once > >>>> people > >>>>>>>> here > >>>>>>>>> are > >>>>>>>>>>> ok with getting this feature in master (2.0). > >>>>>>>>>>> (let say we put a deadline for reviews, like 1 week for > >>>>> reviewing > >>>>>>>> the > >>>>>>>>>> full > >>>>>>>>>>> stuff after everyone agrees to get this in. just to avoid > >>>>> holding > >>>>>>>> this > >>>>>>>>>> for > >>>>>>>>>>> too long, but still enough time to have people that are > >>>>> interested > >>>>>>>> to > >>>>>>>>>> look > >>>>>>>>>>> at it. with did the same thing for MOB with a mega patch > >>>>>>>>>>> https://reviews.apache.org/r/36391/) > >>>>>>>>>> > >>>>>>>>>> This sounds good. Vladimir / Ted how do you guys want to handle > >>>>> the > >>>>>>>>> merge? > >>>>>>>>>> As a giant patch or a rebase of code in the branch and through > >>>> git > >>>>>>>> merge. > >>>>>>>>>> > >>>>>>>>>> We need to run a vote when the to-be-merged branched is ready. > >>>> We > >>>>>> can > >>>>>>>>> set a > >>>>>>>>>> vote timeout for at least 1 week. > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> most of the code seemed isolated from the beginning, few > >>>> changes > >>>>>>>> here > >>>>>>>>> and > >>>>>>>>>>> there in the core. > >>>>>>>>>>> so, this side of things seems ok to me. > >>>>>>>>>>> > >>>>>>>>>>> maybe some work to add IT tests as mentioned above, but that > >>>>>> should > >>>>>>>> not > >>>>>>>>>>> take long. > >>>>>>>>>>> > >>>>>>>>>>> I don't know if there are already docs, but that is another > >>>>> thing > >>>>>> we > >>>>>>>>> may > >>>>>>>>>>> want to get in with the merge. > >>>>>>>>>>> a minimal coverage at least on how to use the feature, and > >>>> maybe > >>>>>>>>> calling > >>>>>>>>>> it > >>>>>>>>>>> out as experimental? > >>>>>>>>>>> > >>>>>>>>>>> my main concern were around incremental backups. > >>>>>>>>>>> I'm still not convinced around the fact that because the WALs > >>>>>>>> contain > >>>>>>>>>>> regions of multiple tables > >>>>>>>>>>> the incremental backup will keep around WALs with some data > >>>> that > >>>>>> we > >>>>>>>>> don't > >>>>>>>>>>> really want in the backup (for space or maybe security > >>>> reason). > >>>>>>>>>>> > >>>>>>>>>>> then there was the question about for how long should I take > >>>>>>>>>> incrementals, > >>>>>>>>>>> before deciding that a fresh full backup is less costly in > >>>> terms > >>>>>> of > >>>>>>>>>> space. > >>>>>>>>>>> but I think this incremental merge/compaction was a feature on > >>>>> the > >>>>>>>>>> roadmap > >>>>>>>>>>> as Phase3. > >>>>>>>>>>> which I think is ok to get later on, > >>>>>>>>>>> maybe just call out a lifecycle example on the docs under > >>>> "best > >>>>>>>>>> practices". > >>>>>>>>>> > >>>>>>>>>> I think this will depend on the use case, and other factors like > >>>>>>>>> bandwidth > >>>>>>>>>> available, how much data > >>>>>>>>>> the user is willing to lose in case of catastrophic failure and > >>>>> how > >>>>>>>>>> "expensive" is full backup versus > >>>>>>>>>> incremental one. > >>>>>>>>>> > >>>>>>>>>> The full backup should also be useable by default, so maybe we > >>>> can > >>>>>>>> make > >>>>>>>>> an > >>>>>>>>>> option to not even keep WAL files, and completely disable > >>>>>> incremental > >>>>>>>>>> backups? > >>>>>>>>>> > >>>>>>>>>> Enis > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> has anyone interested in using backups looked at the doc in > >>>>>>>> HBASE-7912? > >>>>>>>>>>> is the current design of incremental backup acceptable for > >>>>>> everyone > >>>>>>>>>> wanting > >>>>>>>>>>> to use this feature? > >>>>>>>>>>> (maybe this should be a question for the @user list and not > >>>> dev) > >>>>>>>>>>> > >>>>>>>>>>> is there anyone already using this feature or it is just dev > >>>>>> testing > >>>>>>>>> it? > >>>>>>>>>>> to me will be interesting having a use-case/workflow example, > >>>>>>>>>>> to see if in the real world my concerns about incremental are > >>>>> not > >>>>>>>>> showing > >>>>>>>>>>> up. > >>>>>>>>>>> > >>>>>>>>>>> On Tue, Jul 19, 2016 at 1:35 PM, Ted Yu <yuzhih...@gmail.com > >>>>> <javascript:;>> > >>>>>>>> wrote: > >>>>>>>>>>> > >>>>>>>>>>>> Gentle ping on this subject. > >>>>>>>>>>>> > >>>>>>>>>>>> The changes are mostly non-intrusive. > >>>>>>>>>>>> > >>>>>>>>>>>> More comments are welcome. > >>>>>>>>>>>> > >>>>>>>>>>>> On Mon, Jul 11, 2016 at 9:29 PM, Vladimir Rodionov < > >>>>>>>>>>> vladrodio...@gmail.com <javascript:;> > >>>>>>>>>>>> wrote: > >>>>>>>>>>>> > >>>>>>>>>>>>> Not that hard, Andrew. I will open JIRA. > >>>>>>>>>>>>> > >>>>>>>>>>>>> -Vlad > >>>>>>>>>>>>> > >>>>>>>>>>>>> On Mon, Jul 11, 2016 at 8:46 PM, Andrew Purtell < > >>>>>>>>>>>> andrew.purt...@gmail.com <javascript:;>> > >>>>>>>>>>>>> wrote: > >>>>>>>>>>>>> > >>>>>>>>>>>>>> How hard would it be to convert what you've been using > >>>> to > >>>>>> test > >>>>>>>>> end > >>>>>>>>>> to > >>>>>>>>>>>> end > >>>>>>>>>>>>>> during dev into an IT? > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> On Jul 11, 2016, at 5:31 PM, Vladimir Rodionov < > >>>>>>>>>>> vladrodio...@gmail.com <javascript:;> > >>>>>>>>>>>>> > >>>>>>>>>>>>>> wrote: > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>> Is there an integration test in hbase-it yet? If > >>>> not, > >>>>>> any > >>>>>>>>> tips > >>>>>>>>>>> on a > >>>>>>>>>>>>>>>>> semi-automateable way to take backups and restore > >>>>> them? > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> We do not have yet, but we have a lot of unit tests. > >>>> We > >>>>>>>>> provide 2 > >>>>>>>>>>> API > >>>>>>>>>>>>> for > >>>>>>>>>>>>>>> backup: > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> 1. Admin.getBackupAdmin > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> 2. Command - line via hbase command. > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> Everything is straightforward. > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> -Vlad > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 5:23 PM, Dima Spivak < > >>>>>>>>>>> dspi...@cloudera.com <javascript:;>> > >>>>>>>>>>>>>> wrote: > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>> Is there an integration test in hbase-it yet? If not, > >>>>> any > >>>>>>>> tips > >>>>>>>>>> on > >>>>>>>>>>> a > >>>>>>>>>>>>>>>> semi-automateable way to take backups and restore > >>>> them? > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>> -Dima > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 6:42 PM, Vladimir Rodionov < > >>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;> > >>>>>>>>>>>>>>>> wrote: > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>> Sorry, wrong links: > >>>>>>>>>>>>>>>>> These are the phases: > >>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>> Phase 1: > >>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- > >>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14030 > >>>>>>> 14030 > >>>>>>>>>>>>>>>>> Phase 2: > >>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- > >>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14123 > >>>>>>> 14123 > >>>>>>>>>>>>>>>>> Phase 3: > >>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- > >>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14414 > >>>>>>> 14414 > >>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>> -Vlad > >>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 4:41 PM, Vladimir Rodionov < > >>>>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;> > >>>>>>>>>>>>>>>>> wrote: > >>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>> These are the phases: > >>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>> Phase 1: > >>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- > >>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912 > >>>>>>> 14030 > >>>>>>>>>>>>>>>>>> Phase 2: > >>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- > >>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912 > >>>>>>> 14123 > >>>>>>>>>>>>>>>>>> Phase 3: > >>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE- > >>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912 > >>>>>>> 14414 > >>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>> -Vlad > >>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 12:21 PM, Enis Söztutar < > >>>>>>>>>>> e...@apache.org <javascript:;>> > >>>>>>>>>>>>>>>> wrote: > >>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>>> As you guys may already be familiar, Vladimir, > >>>> Ted, > >>>>>>>> Jerry > >>>>>>>>> and > >>>>>>>>>>>>> others > >>>>>>>>>>>>>>>>> have > >>>>>>>>>>>>>>>>>>> been developing the backup / restore functionality > >>>>> in > >>>>>> a > >>>>>>>>>> series > >>>>>>>>>>> of > >>>>>>>>>>>>>>>> issues > >>>>>>>>>>>>>>>>>>> committed in the separate branch HBASE-7912[1]. > >>>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>>> Backup / Restore functionality is tracked as a > >>>>> 4-phase > >>>>>>>>>> project, > >>>>>>>>>>>> and > >>>>>>>>>>>>>>>> the > >>>>>>>>>>>>>>>>>>> first two phases are complete and useable. We are > >>>>> now > >>>>>>>>> working > >>>>>>>>>>> on > >>>>>>>>>>>>>>>> Phase 3 > >>>>>>>>>>>>>>>>>>> items, which are mostly improvements. We think > >>>> that > >>>>>> the > >>>>>>>>>> current > >>>>>>>>>>>>> code > >>>>>>>>>>>>>>>> in > >>>>>>>>>>>>>>>>>>> the > >>>>>>>>>>>>>>>>>>> branch containing all Phase 1 and Phase 2 items, > >>>> and > >>>>>>>> some > >>>>>>>>>>> Phase 3 > >>>>>>>>>>>>>>>> items > >>>>>>>>>>>>>>>>> is > >>>>>>>>>>>>>>>>>>> useable on it's own, and we do not have to wait > >>>> for > >>>>>> all > >>>>>>>> the > >>>>>>>>>>>>>> subtickets > >>>>>>>>>>>>>>>>> to > >>>>>>>>>>>>>>>>>>> be finished to make it completely useable (as > >>>> follow > >>>>>> up > >>>>>>>>>> tickets > >>>>>>>>>>>> are > >>>>>>>>>>>>>>>>> mostly > >>>>>>>>>>>>>>>>>>> improvements or optimizations). The improvements > >>>> in > >>>>>> the > >>>>>>>>> works > >>>>>>>>>>> are > >>>>>>>>>>>>> all > >>>>>>>>>>>>>>>>>>> backwards compatible with the existing stuff. > >>>> Thus, > >>>>> we > >>>>>>>>> would > >>>>>>>>>>> like > >>>>>>>>>>>>> to > >>>>>>>>>>>>>>>>>>> propose that the branch HBASE-7912 be merged into > >>>>>>>> master. > >>>>>>>>>> The > >>>>>>>>>>>>> parent > >>>>>>>>>>>>>>>>> jira > >>>>>>>>>>>>>>>>>>> has a design doc that goes into details about the > >>>>>>>>>>> implementation > >>>>>>>>>>>>> and > >>>>>>>>>>>>>>>>>>> design > >>>>>>>>>>>>>>>>>>> choices in case you are interested[2]. > >>>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>>> Most of the changes are largely non-intrusive and > >>>>>>>> confined > >>>>>>>>> to > >>>>>>>>>>> the > >>>>>>>>>>>>>>>>>>> backup subsystem. > >>>>>>>>>>>>>>>>>>> The unit tests have been passing on manual runs > >>>> and > >>>>> we > >>>>>>>>>>>>> (hortonworks) > >>>>>>>>>>>>>>>>> have > >>>>>>>>>>>>>>>>>>> been running the integration tests as well as some > >>>>>> other > >>>>>>>>>>>>> shell-based > >>>>>>>>>>>>>>>>>>> system > >>>>>>>>>>>>>>>>>>> tests on a forked version of the code. Most of the > >>>>>> work > >>>>>>>> has > >>>>>>>>>>> been > >>>>>>>>>>>>>>>>> reviewed > >>>>>>>>>>>>>>>>>>> by 1, 2 or 3 committers already (mostly Ted, > >>>> myself > >>>>>> and > >>>>>>>>>> Jerry). > >>>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>>> What do you guys think? Is it time to call a vote? > >>>>> Any > >>>>>>>>>> concerns > >>>>>>>>>>>> or > >>>>>>>>>>>>>>>>>>> feedback > >>>>>>>>>>>>>>>>>>> appreciated. > >>>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>>> [1] > >>>>> https://issues.apache.org/jira/browse/HBASE-7912 > >>>>>>>>>>>>>>>>>>> [2] > >>>>> https://issues.apache.org/jira/secure/attachment/12816339/ > >>>> HBaseBackupAndRestore%20-0.91.pdf > >>>>>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>>>> Enis > >>>> > >>>> > >>>> -- > >>>> -Dima > >> >