[
https://issues.apache.org/jira/browse/HBASE-854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12627096#action_12627096
]
Jim Kellerman commented on HBASE-854:
-------------------------------------
This is very strange. The failures only occur on the second test case in a
Junit test in which we spin up multiple HBase and Hadoop clusters. But it
doesn't always fail - that's the baffling part. The following exception causes
the region server to abort, and because the client application cannot connect,
the test times out.
{code}
[junit] 2008-08-26 21:24:50,536 FATAL [RegionServer:0]
regionserver.HRegionServer(424): Unhandled exception. Aborting...
[junit] java.io.IOException: Call failed on local exception
[junit] at org.apache.hadoop.ipc.Client.call(Client.java:718)
[junit] at
org.apache.hadoop.hbase.ipc.HbaseRPC$Invoker.invoke(HbaseRPC.java:230)
[junit] at $Proxy8.getProtocolVersion(Unknown Source)
[junit] at
org.apache.hadoop.hbase.ipc.HbaseRPC.getProxy(HbaseRPC.java:340)
[junit] at
org.apache.hadoop.hbase.ipc.HbaseRPC.getProxy(HbaseRPC.java:327)
[junit] at
org.apache.hadoop.hbase.ipc.HbaseRPC.getProxy(HbaseRPC.java:364)
[junit] at
org.apache.hadoop.hbase.ipc.HbaseRPC.waitForProxy(HbaseRPC.java:302)
[junit] at
org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:706)
[junit] at
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:291)
[junit] at java.lang.Thread.run(Thread.java:619)
[junit] Caused by: java.io.EOFException
[junit] at java.io.DataInputStream.readInt(DataInputStream.java:375)
[junit] at
org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:499)
[junit] at org.apache.hadoop.ipc.Client$Connection.run(Client.java:441)
[junit] 2008-08-26 21:24:50,538 DEBUG [RegionServer:0]
hbase.RegionHistorian(316): Offlined
[junit] Exception in thread "RegionServer:0" java.lang.NullPointerException
[junit] at
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:450)
[junit] at java.lang.Thread.run(Thread.java:619)
{code}
Now it could be related to the fact that the client cache is static in HbaseRPC:
{code}
private static ClientCache CLIENTS=new ClientCache();
{code}
But why don't all tests that have multiple test cases fail all the time on the
second cluster spin up?
> hbase-841 broke build on hudson?
> --------------------------------
>
> Key: HBASE-854
> URL: https://issues.apache.org/jira/browse/HBASE-854
> Project: Hadoop HBase
> Issue Type: Bug
> Reporter: stack
> Assignee: Jim Kellerman
> Priority: Blocker
> Fix For: 0.18.0
>
> Attachments: hbase-854.patch
>
>
> Jim, you want to take a look at it?
> 841 changed interfaces. Changed interfaces can make for odd issues like the
> hangs exhibited up on hudson (stuff is failing for me on my laptop since
> about the commit 841... timeouts. I don't have same issue on branch).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.