The scheme you propose would be good so long as we only ever do
things like rename files and move them around. If we ever decide to
change something significant, like the underlying file structure
(like if we break away from using MapFile or something), then we'd
need the ability to read the old version as well as write the new
ones. What would you like to be able to do in these instances?
On Dec 14, 2007, at 12:46 PM, stack (JIRA) wrote:
[ https://issues.apache.org/jira/browse/HADOOP-2394?
page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
tabpanel#action_12551938 ]
stack commented on HADOOP-2394:
-------------------------------
I like the rails idea. Migration should support going in both
directions I'd say.
hbase state is all kept out in the filesystem so hopefully,
filesystem machinations should be all thats required making
migrations.
HStoreFiles are MapFiles + an info file stored in a sympathetic
directory. This info file has little in it currently -- just
sequence id. Could also have hbase version. For log files,
perhaps first record is stamp of the hbase version doing the writing.
It occurred to me that migrations could entail significant
rewriting of on-filesystem data. To distribute the migration, we
could we could have the master and regionservers run the
migrations. Each server on startup would look for any migrations
to run and just run them if any found. Nice thing about this is
that we'd get the migration job distributed. But thinking on it,
probably better to have the migration done outside of hbase in its
own dedicated MR job. Would be easier tracking failures and
running reversals.
Add supprt for migrating between hbase versions
-----------------------------------------------
Key: HADOOP-2394
URL: https://issues.apache.org/jira/browse/
HADOOP-2394
Project: Hadoop
Issue Type: Improvement
Components: contrib/hbase
Reporter: Johan Oskarsson
If Hbase is to be used to serve data to live systems we would need
a way to upgrade both the underlying hadoop installation and hbase
to newer versions with minimal downtime.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.