Dima Spivak created HBASE-12818:
-----------------------------------
Summary: TestSnapshotFromMaster#testSnapshotHFileArchiving is
failing occasionally
Key: HBASE-12818
URL: https://issues.apache.org/jira/browse/HBASE-12818
Project: HBase
Issue Type: Bug
Components: test
Affects Versions: 1.1.0
Reporter: Dima Spivak
Starting in late December, our internal runs on branch-1 have failed every few
days with the following:
{code}
java.lang.AssertionError: Archived hfiles [] is missing snapshot
file:c7ed21029d4e4cb28fdd138a28c8f3e7
at org.junit.Assert.fail(Assert.java:88)
at org.junit.Assert.assertTrue(Assert.java:41)
at
org.apache.hadoop.hbase.master.cleaner.TestSnapshotFromMaster.testSnapshotHFileArchiving(TestSnapshotFromMaster.java:347)
{code}
A full log of the stdout from one of these runs can be found [on
Gist|https://gist.github.com/dimaspivak/1c27a5d95e26c4bedc4f]; the relevant
portion seems to be
{code}
2015-01-07 05:04:06,015 DEBUG [B.defaultRpcServer.handler=3,queue=0,port=55325]
util.FSTableDescriptors(177): Exception during readTableDecriptor. Current
table name = test
org.apache.hadoop.hbase.TableInfoMissingException: No table descriptor file
under
hdfs://localhost:42974/user/jenkins/test-data/ce542e14-a8e7-4764-8a68-8fb6401ebeb8/data/default/test
at
org.apache.hadoop.hbase.util.FSTableDescriptors.getTableDescriptorFromFs(FSTableDescriptors.java:509)
at
org.apache.hadoop.hbase.util.FSTableDescriptors.getTableDescriptorFromFs(FSTableDescriptors.java:487)
at
org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:172)
at
org.apache.hadoop.hbase.master.HMaster.listTableDescriptors(HMaster.java:2165)
at
org.apache.hadoop.hbase.master.MasterRpcServices.getTableDescriptors(MasterRpcServices.java:787)
at
org.apache.hadoop.hbase.protobuf.generated.MasterProtos$MasterService$2.callBlockingMethod(MasterProtos.java:42402)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2028)
at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:108)
at
org.apache.hadoop.hbase.ipc.RpcExecutor.consumerLoop(RpcExecutor.java:130)
at org.apache.hadoop.hbase.ipc.RpcExecutor$1.run(RpcExecutor.java:107)
at java.lang.Thread.run(Thread.java:744)
{code}
I've yet to see this on upstream runs, but I'll keep an eye out. Also, note
that this is unrelated to HBASE-9072, which had the same test failing for other
reasons back in 2013.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)