[
https://issues.apache.org/jira/browse/HADOOP-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521525
]
Jim Kellerman commented on HADOOP-1527:
---------------------------------------
Even better, if the master discovers a stale entry in the root or meta regions,
it should go look to see if the log file exists an split it before assigning
the region to a new server.
This would even handle the case where the region server serving the root region
died because it is highly unlikely that a region server would have only been
serving the root region.
So the plan of attack is to add a check in the master upon discovery of a stale
entry in the root and meta regions, and to create a separate utility to recover
a region server log in the unlikely event that a region server was only serving
the root region.
> Region server won't start because logdir exists
> -----------------------------------------------
>
> Key: HADOOP-1527
> URL: https://issues.apache.org/jira/browse/HADOOP-1527
> Project: Hadoop
> Issue Type: Bug
> Components: contrib/hbase
> Reporter: stack
> Assignee: Jim Kellerman
>
> Starting and then impolitely stopping a cluster I came across the following:
> 2007-06-25 19:43:31,449 ERROR org.apache.hadoop.hbase.HRegionServer: Can not
> start region server because
> org.apache.hadoop.hbase.RegionServerRunningException: region server already
> running at 208.76.44.140:60010 because logdir exists
> at
> org.apache.hadoop.hbase.HRegionServer.<init>(HRegionServer.java:447)
> at
> org.apache.hadoop.hbase.HRegionServer.<init>(HRegionServer.java:372)
> at org.apache.hadoop.hbase.HRegionServer.main(HRegionServer.java:1233)
> Region server should recover or offer a recovery path when we run into this
> condition.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.