Busy at work, aiming for next week. 

> On Sep 1, 2016, at 8:44 AM, Ted Yu <yuzhih...@gmail.com> wrote:
> 
> Andrew:
> HBASE-16255 has been resolved.
> 
> Kindly provide your feedback.
> 
> Thanks
> 
> On Sat, Aug 20, 2016 at 11:06 AM, Andrew Purtell <andrew.purt...@gmail.com>
> wrote:
> 
>> I plan to spin up a test cluster with clusterdock and try running the IT
>> under a number of different scenarios. I understand snapshots have to
>> function so baseline would be the calm monkey.
>> 
>> Unless you have some other automated way for me to run the new
>> functionality repeatedly, the IT is it.
>> 
>>>> On Aug 20, 2016, at 10:59 AM, Vladimir Rodionov <vladrodio...@gmail.com>
>>> wrote:
>>> 
>>> Not sure what do you mean, Andrew by "trying out the branch via the IT",
>>> but we do not recommend running this with monkey enabled.
>>> It has not been tested in a such scenario yet and frankly speaking it is
>>> not supposed to work (snapshots will fail anyway and we depends on
>>> snapshots)
>>> 
>>> -Vladimir
>>> 
>>> On Sat, Aug 20, 2016 at 10:29 AM, Andrew Purtell <
>> andrew.purt...@gmail.com>
>>> wrote:
>>> 
>>>> Let's commit the IT to the branch, if you think the v5 patch is ready
>> for
>>>> commit Ted.
>>>> 
>>>> I will be able to spend some time next week trying out the branch via
>> the
>>>> IT, and poking around with the new tools. After that I feel like I'll be
>>>> informed enough to vote on a branch merge vote.
>>>> 
>>>>> On Aug 19, 2016, at 12:38 PM, Ted Yu <yuzhih...@gmail.com> wrote:
>>>>> 
>>>>> IT test is provided on HBASE-16255.
>>>>> 
>>>>> Any other comment ?
>>>>> 
>>>>> Thanks
>>>>> 
>>>>>> On Tue, Aug 2, 2016 at 9:09 PM, Dima Spivak <dspi...@cloudera.com>
>>>> wrote:
>>>>>> 
>>>>>> Any chance for an IT test being added to the branch first? I'd love to
>>>> put
>>>>>> it through the paces with clusterdock to make sure it behaves well
>> with
>>>>>> fault injection and the like.
>>>>>> 
>>>>>> -Dima
>>>>>> 
>>>>>>> On Tuesday, August 2, 2016, Ted Yu <yuzhih...@gmail.com> wrote:
>>>>>>> 
>>>>>>> Any more comments from the community on whether the merge can be
>>>>>> conducted
>>>>>>> ?
>>>>>>> 
>>>>>>> Thanks
>>>>>>> 
>>>>>>> On Mon, Aug 1, 2016 at 12:03 PM, Vladimir Rodionov <
>>>>>> vladrodio...@gmail.com
>>>>>>> <javascript:;>>
>>>>>>> wrote:
>>>>>>> 
>>>>>>>> Carter Shanklin posted a blog article about the feature:
>>>>>>>> Some use cases and examples of a command line interface usage.
>>>>>>> https://hortonworks.com/blog/coming-hdp-2-5-incremental-
>>>>>> backup-restore-apache-hbase-apache-phoenix/
>>>>>>>> 
>>>>>>>> -Vlad
>>>>>>>> 
>>>>>>>> On Wed, Jul 20, 2016 at 1:25 PM, Vladimir Rodionov <
>>>>>>> vladrodio...@gmail.com <javascript:;>
>>>>>>>> wrote:
>>>>>>>> 
>>>>>>>>> Ok, got it.
>>>>>>>>> 
>>>>>>>>> -Vlad
>>>>>>>>> 
>>>>>>>>> On Wed, Jul 20, 2016 at 12:15 PM, Enis Söztutar <e...@apache.org
>>>>>>> <javascript:;>> wrote:
>>>>>>>>> 
>>>>>>>>>> We keep the WALs which can accumulate a lot if the use case is to
>>>>>> only
>>>>>>>> do
>>>>>>>>>> backups infrequently. This will definitely cause issues since HDFS
>>>>>>> space
>>>>>>>>>> will get filled up. That is why we may need an option for having
>>>>>>>>>> incremental backups not used, and WAL references being deleted.
>>>>>>>>>> 
>>>>>>>>>> Enis
>>>>>>>>>> 
>>>>>>>>>> On Tue, Jul 19, 2016 at 6:33 PM, Vladimir Rodionov <
>>>>>>>>>> vladrodio...@gmail.com <javascript:;>>
>>>>>>>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>>> Why anyone will ever need disabling incremental backups? If you
>> do
>>>>>>> not
>>>>>>>>>> need
>>>>>>>>>>> it - just run only full backups.
>>>>>>>>>>> 
>>>>>>>>>>> -Vlad
>>>>>>>>>>> 
>>>>>>>>>>> On Tue, Jul 19, 2016 at 6:21 PM, Enis Söztutar <e...@apache.org
>>>>>>> <javascript:;>>
>>>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>> Thanks Matteo for chiming in.
>>>>>>>>>>>> 
>>>>>>>>>>>> On Tue, Jul 19, 2016 at 5:02 PM, Matteo Bertozzi <
>>>>>>>>>>> theo.berto...@gmail.com <javascript:;>>
>>>>>>>>>>>> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>>> I did some review in the early beginning, but then lost track
>>>>>> of
>>>>>>>> the
>>>>>>>>>>>>> changes.
>>>>>>>>>>>>> but I'd like to give a quick review to the full code once
>>>>>> people
>>>>>>>>>> here
>>>>>>>>>>> are
>>>>>>>>>>>>> ok with getting this feature in master (2.0).
>>>>>>>>>>>>> (let say we put a deadline for reviews, like 1 week for
>>>>>>> reviewing
>>>>>>>>>> the
>>>>>>>>>>>> full
>>>>>>>>>>>>> stuff after everyone agrees to get this in. just to avoid
>>>>>>> holding
>>>>>>>>>> this
>>>>>>>>>>>> for
>>>>>>>>>>>>> too long, but still enough time to have people that are
>>>>>>> interested
>>>>>>>>>> to
>>>>>>>>>>>> look
>>>>>>>>>>>>> at it. with did the same thing for MOB with a mega patch
>>>>>>>>>>>>> https://reviews.apache.org/r/36391/)
>>>>>>>>>>>> 
>>>>>>>>>>>> This sounds good. Vladimir / Ted how do you guys want to handle
>>>>>>> the
>>>>>>>>>>> merge?
>>>>>>>>>>>> As a giant patch or a rebase of code in the branch and through
>>>>>> git
>>>>>>>>>> merge.
>>>>>>>>>>>> 
>>>>>>>>>>>> We need to run a vote when the to-be-merged branched is ready.
>>>>>> We
>>>>>>>> can
>>>>>>>>>>> set a
>>>>>>>>>>>> vote timeout for at least 1 week.
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>>> most of the code seemed isolated from the beginning, few
>>>>>> changes
>>>>>>>>>> here
>>>>>>>>>>> and
>>>>>>>>>>>>> there in the core.
>>>>>>>>>>>>> so, this side of things seems ok to me.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> maybe some work to add IT tests as mentioned above, but that
>>>>>>>> should
>>>>>>>>>> not
>>>>>>>>>>>>> take long.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> I don't know if there are already docs, but that is another
>>>>>>> thing
>>>>>>>> we
>>>>>>>>>>> may
>>>>>>>>>>>>> want to get in with the merge.
>>>>>>>>>>>>> a minimal coverage at least on how to use the feature, and
>>>>>> maybe
>>>>>>>>>>> calling
>>>>>>>>>>>> it
>>>>>>>>>>>>> out as experimental?
>>>>>>>>>>>>> 
>>>>>>>>>>>>> my main concern were around incremental backups.
>>>>>>>>>>>>> I'm still not convinced around the fact that because the WALs
>>>>>>>>>> contain
>>>>>>>>>>>>> regions of multiple tables
>>>>>>>>>>>>> the incremental backup will keep around WALs with some data
>>>>>> that
>>>>>>>> we
>>>>>>>>>>> don't
>>>>>>>>>>>>> really want in the backup (for space or maybe security
>>>>>> reason).
>>>>>>>>>>>>> 
>>>>>>>>>>>>> then there was the question about for how long should I take
>>>>>>>>>>>> incrementals,
>>>>>>>>>>>>> before deciding that a fresh full backup is less costly in
>>>>>> terms
>>>>>>>> of
>>>>>>>>>>>> space.
>>>>>>>>>>>>> but I think this incremental merge/compaction was a feature on
>>>>>>> the
>>>>>>>>>>>> roadmap
>>>>>>>>>>>>> as Phase3.
>>>>>>>>>>>>> which I think is ok to get later on,
>>>>>>>>>>>>> maybe just call out a lifecycle example on the docs under
>>>>>> "best
>>>>>>>>>>>> practices".
>>>>>>>>>>>> 
>>>>>>>>>>>> I think this will depend on the use case, and other factors like
>>>>>>>>>>> bandwidth
>>>>>>>>>>>> available, how much data
>>>>>>>>>>>> the user is willing to lose in case of catastrophic failure and
>>>>>>> how
>>>>>>>>>>>> "expensive" is full backup versus
>>>>>>>>>>>> incremental one.
>>>>>>>>>>>> 
>>>>>>>>>>>> The full backup should also be useable by default, so maybe we
>>>>>> can
>>>>>>>>>> make
>>>>>>>>>>> an
>>>>>>>>>>>> option to not even keep WAL files, and completely disable
>>>>>>>> incremental
>>>>>>>>>>>> backups?
>>>>>>>>>>>> 
>>>>>>>>>>>> Enis
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>>> has anyone interested in using backups looked at the doc in
>>>>>>>>>> HBASE-7912?
>>>>>>>>>>>>> is the current design of incremental backup acceptable for
>>>>>>>> everyone
>>>>>>>>>>>> wanting
>>>>>>>>>>>>> to use this feature?
>>>>>>>>>>>>> (maybe this should be a question for the @user list and not
>>>>>> dev)
>>>>>>>>>>>>> 
>>>>>>>>>>>>> is there anyone already using this feature or it is just dev
>>>>>>>> testing
>>>>>>>>>>> it?
>>>>>>>>>>>>> to me will be interesting having a use-case/workflow example,
>>>>>>>>>>>>> to see if in the real world my concerns about incremental are
>>>>>>> not
>>>>>>>>>>> showing
>>>>>>>>>>>>> up.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> On Tue, Jul 19, 2016 at 1:35 PM, Ted Yu <yuzhih...@gmail.com
>>>>>>> <javascript:;>>
>>>>>>>>>> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Gentle ping on this subject.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> The changes are mostly non-intrusive.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> More comments are welcome.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 9:29 PM, Vladimir Rodionov <
>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;>
>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Not that hard, Andrew. I will open JIRA.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> -Vlad
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 8:46 PM, Andrew Purtell <
>>>>>>>>>>>>>> andrew.purt...@gmail.com <javascript:;>>
>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> How hard would it be to convert what you've been using
>>>>>> to
>>>>>>>> test
>>>>>>>>>>> end
>>>>>>>>>>>> to
>>>>>>>>>>>>>> end
>>>>>>>>>>>>>>>> during dev into an IT?
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> On Jul 11, 2016, at 5:31 PM, Vladimir Rodionov <
>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;>
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>> Is there an integration test in hbase-it yet? If
>>>>>> not,
>>>>>>>> any
>>>>>>>>>>> tips
>>>>>>>>>>>>> on a
>>>>>>>>>>>>>>>>>>> semi-automateable way to take backups and restore
>>>>>>> them?
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> We do not have yet, but we have a lot of unit tests.
>>>>>> We
>>>>>>>>>>> provide 2
>>>>>>>>>>>>> API
>>>>>>>>>>>>>>> for
>>>>>>>>>>>>>>>>> backup:
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> 1. Admin.getBackupAdmin
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> 2. Command - line via hbase command.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Everything is straightforward.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> -Vlad
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 5:23 PM, Dima Spivak <
>>>>>>>>>>>>> dspi...@cloudera.com <javascript:;>>
>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>> Is there an integration test in hbase-it yet? If not,
>>>>>>> any
>>>>>>>>>> tips
>>>>>>>>>>>> on
>>>>>>>>>>>>> a
>>>>>>>>>>>>>>>>>> semi-automateable way to take backups and restore
>>>>>> them?
>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>> -Dima
>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 6:42 PM, Vladimir Rodionov <
>>>>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;>
>>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>> Sorry, wrong links:
>>>>>>>>>>>>>>>>>>> These are the phases:
>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>> Phase 1:
>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE-
>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14030
>>>>>>>>> 14030
>>>>>>>>>>>>>>>>>>> Phase 2:
>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE-
>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14123
>>>>>>>>> 14123
>>>>>>>>>>>>>>>>>>> Phase 3:
>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE-
>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-14414
>>>>>>>>> 14414
>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>> -Vlad
>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 4:41 PM, Vladimir Rodionov <
>>>>>>>>>>>>>>>>>> vladrodio...@gmail.com <javascript:;>
>>>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>> These are the phases:
>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>> Phase 1:
>>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE-
>>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912
>>>>>>>>> 14030
>>>>>>>>>>>>>>>>>>>> Phase 2:
>>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE-
>>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912
>>>>>>>>> 14123
>>>>>>>>>>>>>>>>>>>> Phase 3:
>>>>>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/HBASE-
>>>>>>>>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/HBASE-7912
>>>>>>>>> 14414
>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>> -Vlad
>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>> On Mon, Jul 11, 2016 at 12:21 PM, Enis Söztutar <
>>>>>>>>>>>>> e...@apache.org <javascript:;>>
>>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>>> As you guys may already be familiar, Vladimir,
>>>>>> Ted,
>>>>>>>>>> Jerry
>>>>>>>>>>> and
>>>>>>>>>>>>>>> others
>>>>>>>>>>>>>>>>>>> have
>>>>>>>>>>>>>>>>>>>>> been developing the backup / restore functionality
>>>>>>> in
>>>>>>>> a
>>>>>>>>>>>> series
>>>>>>>>>>>>> of
>>>>>>>>>>>>>>>>>> issues
>>>>>>>>>>>>>>>>>>>>> committed in the separate branch HBASE-7912[1].
>>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>>> Backup / Restore functionality is tracked as a
>>>>>>> 4-phase
>>>>>>>>>>>> project,
>>>>>>>>>>>>>> and
>>>>>>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>>>>>>> first two phases are complete and useable. We are
>>>>>>> now
>>>>>>>>>>> working
>>>>>>>>>>>>> on
>>>>>>>>>>>>>>>>>> Phase 3
>>>>>>>>>>>>>>>>>>>>> items, which are mostly improvements. We think
>>>>>> that
>>>>>>>> the
>>>>>>>>>>>> current
>>>>>>>>>>>>>>> code
>>>>>>>>>>>>>>>>>> in
>>>>>>>>>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>>>>>>> branch containing all Phase 1 and Phase 2 items,
>>>>>> and
>>>>>>>>>> some
>>>>>>>>>>>>> Phase 3
>>>>>>>>>>>>>>>>>> items
>>>>>>>>>>>>>>>>>>> is
>>>>>>>>>>>>>>>>>>>>> useable on it's own, and we do not have to wait
>>>>>> for
>>>>>>>> all
>>>>>>>>>> the
>>>>>>>>>>>>>>>> subtickets
>>>>>>>>>>>>>>>>>>> to
>>>>>>>>>>>>>>>>>>>>> be finished to make it completely useable (as
>>>>>> follow
>>>>>>>> up
>>>>>>>>>>>> tickets
>>>>>>>>>>>>>> are
>>>>>>>>>>>>>>>>>>> mostly
>>>>>>>>>>>>>>>>>>>>> improvements or optimizations). The improvements
>>>>>> in
>>>>>>>> the
>>>>>>>>>>> works
>>>>>>>>>>>>> are
>>>>>>>>>>>>>>> all
>>>>>>>>>>>>>>>>>>>>> backwards compatible with the existing stuff.
>>>>>> Thus,
>>>>>>> we
>>>>>>>>>>> would
>>>>>>>>>>>>> like
>>>>>>>>>>>>>>> to
>>>>>>>>>>>>>>>>>>>>> propose that the branch HBASE-7912 be merged into
>>>>>>>>>> master.
>>>>>>>>>>>> The
>>>>>>>>>>>>>>> parent
>>>>>>>>>>>>>>>>>>> jira
>>>>>>>>>>>>>>>>>>>>> has a design doc that goes into details about the
>>>>>>>>>>>>> implementation
>>>>>>>>>>>>>>> and
>>>>>>>>>>>>>>>>>>>>> design
>>>>>>>>>>>>>>>>>>>>> choices in case you are interested[2].
>>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>>> Most of the changes are largely non-intrusive and
>>>>>>>>>> confined
>>>>>>>>>>> to
>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>>>>>>> backup subsystem.
>>>>>>>>>>>>>>>>>>>>> The unit tests have been passing on manual runs
>>>>>> and
>>>>>>> we
>>>>>>>>>>>>>>> (hortonworks)
>>>>>>>>>>>>>>>>>>> have
>>>>>>>>>>>>>>>>>>>>> been running the integration tests as well as some
>>>>>>>> other
>>>>>>>>>>>>>>> shell-based
>>>>>>>>>>>>>>>>>>>>> system
>>>>>>>>>>>>>>>>>>>>> tests on a forked version of the code. Most of the
>>>>>>>> work
>>>>>>>>>> has
>>>>>>>>>>>>> been
>>>>>>>>>>>>>>>>>>> reviewed
>>>>>>>>>>>>>>>>>>>>> by 1, 2 or 3 committers already (mostly Ted,
>>>>>> myself
>>>>>>>> and
>>>>>>>>>>>> Jerry).
>>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>>> What do you guys think? Is it time to call a vote?
>>>>>>> Any
>>>>>>>>>>>> concerns
>>>>>>>>>>>>>> or
>>>>>>>>>>>>>>>>>>>>> feedback
>>>>>>>>>>>>>>>>>>>>> appreciated.
>>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>>> [1]
>>>>>>> https://issues.apache.org/jira/browse/HBASE-7912
>>>>>>>>>>>>>>>>>>>>> [2]
>>>>>>> https://issues.apache.org/jira/secure/attachment/12816339/
>>>>>> HBaseBackupAndRestore%20-0.91.pdf
>>>>>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>>>>>> Enis
>>>>>> 
>>>>>> 
>>>>>> --
>>>>>> -Dima
>> 

Reply via email to