[
https://issues.apache.org/jira/browse/HBASE-927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652856#action_12652856
]
stack commented on HBASE-927:
-----------------------------
This patch looks to have broken trunk. Hudson fails. If I revert my trunk to
r722704, the version before hbase-1042, build fails in tests like
org.apache.hadoop.hbase.TestGlobalMemcacheLimit with below:
{code}
2008-12-03 08:57:01,369 DEBUG [HMaster] master.HMaster(421): Main processing
loop: PendingOpenOperation from 127.0.0.1:39337
2008-12-03 08:57:01,371 INFO [HMaster] master.ProcessRegionOpen$1(71):
.META.,,1 open on 127.0.0.1:39337
2008-12-03 08:57:01,372 INFO [HMaster] master.ProcessRegionOpen$1(82):
updating row .META.,,1 in region -ROOT-,,0 with startcode 1228323417464 and
server 127.0.0.1:39337
2008-12-03 08:57:03,185 DEBUG [main]
client.HConnectionManager$TableServers(792): Found ROOT REGION => {NAME =>
'-ROOT-,,0', STARTKEY => '', ENDKEY => '', ENCODED => 70236052, TABLE => {{NAME
=> '-ROOT-', IS_ROOT => 'true', IS_META => 'true', FAMILIES =>
{NAME => 'info', BLOOMFILTER => 'false', COMPRESSION => 'NONE', VERSIONS =>
'10', LENGTH => '2147483647', TTL => '-1', IN_MEMORY => 'false', BLOCKCACHE =>
'false'}]}}
2008-12-03 08:57:03,225 ERROR [main] hbase.HBaseClusterTestCase(130): Exception
in setup!
org.apache.hadoop.hbase.master.NotAllMetaRegionsOnlineException:
org.apache.hadoop.hbase.master.NotAllMetaRegionsOnlineException
at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:596)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.hbase.ipc.HbaseRPC$Server.call(HbaseRPC.java:634)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:892)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at
org.apache.hadoop.hbase.RemoteExceptionHandler.decodeRemoteException(RemoteExceptionHandler.java:82)
at
org.apache.hadoop.hbase.client.HBaseAdmin.createTableAsync(HBaseAdmin.java:195)
at
org.apache.hadoop.hbase.client.HBaseAdmin.createTable(HBaseAdmin.java:153)
at
org.apache.hadoop.hbase.TestGlobalMemcacheLimit.postHBaseClusterSetup(TestGlobalMemcacheLimit.java:70)
at
org.apache.hadoop.hbase.HBaseClusterTestCase.setUp(HBaseClusterTestCase.java:128)
at junit.framework.TestCase.runBare(TestCase.java:125)
at junit.framework.TestResult$1.protect(TestResult.java:106)
at junit.framework.TestResult.runProtected(TestResult.java:124)
at junit.framework.TestResult.run(TestResult.java:109)
at junit.framework.TestCase.run(TestCase.java:118)
at junit.framework.TestSuite.runTest(TestSuite.java:208)
at junit.framework.TestSuite.run(TestSuite.java:203)
at
org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.run(JUnitTestRunner.java:421)
at
org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.launch(JUnitTestRunner.java:912)
at
org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.main(JUnitTestRunner.java:766)
2008-12-03 08:57:03,226 DEBUG [main] hbase.LocalHBaseCluster(254): Shutting
down HBase Cluster
{code}
> We don't recover if HRS hosting -ROOT-/.META. goes down
> -------------------------------------------------------
>
> Key: HBASE-927
> URL: https://issues.apache.org/jira/browse/HBASE-927
> Project: Hadoop HBase
> Issue Type: Bug
> Reporter: stack
> Assignee: Jim Kellerman
> Priority: Blocker
> Fix For: 0.19.0
>
>
> To replicate, set up a cluster with a master and a regionserver. Start up
> the the cluster. Kill the regionserver. Master just does this over and over:
> {code}
> ...
> 2008-10-14 18:54:14,737 INFO org.apache.hadoop.hbase.master.BaseScanner:
> RegionManager.metaScanner scanning meta region {regionname: .META.,,1,
> startKey: <>, server: XX.XX.XX.XX:60020}
> 2008-10-14 18:54:15,739 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server:XX.XX.XX.XX:60020. Already tried 0 time(s).
> 2008-10-14 18:54:16,742 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: XX.XX.XX.XX:60020. Already tried 1 time(s).
> 2008-10-14 18:54:17,744 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: XX.XX.XX.XX:60020. Already tried 2 time(s).
> 2008-10-14 18:54:18,747 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server:XX.XX.XX.XX:60020. Already tried 3 time(s).
> 2008-10-14 18:54:19,749 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: XX.XX.XX.XX:60020. Already tried 4 time(s).
> 2008-10-14 18:54:20,752 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: XX.XX.XX.XX:60020. Already tried 5 time(s).
> 2008-10-14 18:54:21,755 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: XX.XX.XX.XX:60020. Already tried 6 time(s).
> 2008-10-14 18:54:22,757 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server:XX.XX.XX.XX:60020. Already tried 7 time(s).
> 2008-10-14 18:54:23,759 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server:XX.XX.XX.XX:60020. Already tried 8 time(s).
> 2008-10-14 18:54:24,762 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server:XX.XX.XX.XX:60020. Already tried 9 time(s).
> 2008-10-14 18:54:24,763 WARN org.apache.hadoop.hbase.master.BaseScanner: Scan
> one META region: {regionname: .META.,,1, startKey: <>, server:
> XX.XX.XX.XX:60020}
> java.io.IOException: Call failed on local exception
> at org.apache.hadoop.ipc.Client.call(Client.java:718)
> at
> org.apache.hadoop.hbase.ipc.HbaseRPC$Invoker.invoke(HbaseRPC.java:245)
> at $Proxy2.openScanner(Unknown Source)
> at
> org.apache.hadoop.hbase.master.BaseScanner.scanRegion(BaseScanner.java:159)
> at
> org.apache.hadoop.hbase.master.MetaScanner.scanOneMetaRegion(MetaScanner.java:74)
> at
> org.apache.hadoop.hbase.master.MetaScanner.maintenanceScan(MetaScanner.java:129)
> at
> org.apache.hadoop.hbase.master.BaseScanner.chore(BaseScanner.java:139)
> at org.apache.hadoop.hbase.Chore.run(Chore.java:62)
> Caused by: java.net.ConnectException: Connection refused
> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> at
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592)
> at sun.nio.ch.SocketAdaptor.connect(SocketAdaptor.java:118)
> at
> org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:300)
> at
> org.apache.hadoop.ipc.Client$Connection.access$1700(Client.java:177)
> at org.apache.hadoop.ipc.Client.getConnection(Client.java:789)
> at org.apache.hadoop.ipc.Client.call(Client.java:704)
> ... 7 more
> 2008-10-14 18:54:24,766 INFO org.apache.hadoop.hbase.master.BaseScanner: all
> meta regions scanned
> ...
> {code}
> Made it a blocker.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.