Re: [jira] Commented: (HADOOP-2394) Add supprt for migrating between hbase versions

stack Fri, 14 Dec 2007 14:31:09 -0800

Your scenario, moving from one type to another, should be easy enough tomigrate; you'd just float both classes in the migration task and run theconversion from one type to the other. But I think you were intendingto ask the harder question of going between versions of the same type.Taking your example of MapFiles, MapFiles are versioned and it lookslike there's some attempt at making it so newer versions can read fileswritten by older versions. I'd suggest that any hbase class that makesmarks on the filesystem should be made do likewise. HLog emissions andcatalog tables, -ROOT- and .META., look like obvious candidates forversioning.


St.Ack



Bryan Duxbury wrote:

The scheme you propose would be good so long as we only ever do thingslike rename files and move them around. If we ever decide to changesomething significant, like the underlying file structure (like if webreak away from using MapFile or something), then we'd need theability to read the old version as well as write the new ones. Whatwould you like to be able to do in these instances?
On Dec 14, 2007, at 12:46 PM, stack (JIRA) wrote:
[https://issues.apache.org/jira/browse/HADOOP-2394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12551938]
stack commented on HADOOP-2394:
-------------------------------
I like the rails idea. Migration should support going in bothdirections I'd say.
hbase state is all kept out in the filesystem so hopefully,filesystem machinations should be all thats required making migrations.
HStoreFiles are MapFiles + an info file stored in a sympatheticdirectory. This info file has little in it currently -- justsequence id. Could also have hbase version. For log files, perhapsfirst record is stamp of the hbase version doing the writing.
It occurred to me that migrations could entail significant rewritingof on-filesystem data. To distribute the migration, we could wecould have the master and regionservers run the migrations. Eachserver on startup would look for any migrations to run and just runthem if any found. Nice thing about this is that we'd get themigration job distributed. But thinking on it, probably better tohave the migration done outside of hbase in its own dedicated MRjob. Would be easier tracking failures and running reversals.
Add supprt for migrating between hbase versions
-----------------------------------------------

                Key: HADOOP-2394
                URL: https://issues.apache.org/jira/browse/HADOOP-2394
            Project: Hadoop
         Issue Type: Improvement
         Components: contrib/hbase
           Reporter: Johan Oskarsson
If Hbase is to be used to serve data to live systems we would need away to upgrade both the underlying hadoop installation and hbase tonewer versions with minimal downtime.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: [jira] Commented: (HADOOP-2394) Add supprt for migrating between hbase versions

Reply via email to