[ 
https://issues.apache.org/jira/browse/HADOOP-2662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12560613#action_12560613
 ] 

stack commented on HADOOP-2662:
-------------------------------

In the master log, what do you see? Its waiting and waiting and then just gives 
up after lease timeout because it says the regionservers are taking too long to 
report in?

The code doesn't currently accomodate a regionserver that is taking its time 
going down because its busy running flushes and outstanding, etc.  It probably 
should (regionserver reports every so often to the master that its working on 
the close).

There should be no data loss in this case (Regionservers are going down 
properly -- its just the final report in to the master that its down that is 
failing)

> master stopping before regions finish stopping
> ----------------------------------------------
>
>                 Key: HADOOP-2662
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2662
>             Project: Hadoop
>          Issue Type: Bug
>          Components: contrib/hbase
>            Reporter: Billy Pearson
>            Priority: Minor
>             Fix For: 0.16.1
>
>
> I get this on my region servers logs sometimes when I shutdown the cluster it 
> repeats several times sometimes trying to find the master
> I thank we need to look at the master and make sure we do not stop the master 
> on exit before all region servers report down.
> I am not sure if there could be data loss or not but we should not leave 
> region servers looking for the master unless it has failed on it own.
> {code}
> 2008-01-18 17:40:42,009 WARN org.apache.hadoop.hbase.HRegionServer: Failed to 
> send exiting message to master:
> java.net.ConnectException: Connection refused
>         at java.net.PlainSocketImpl.socketConnect(Native Method)
>         at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
>         at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
>         at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
>         at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
>         at java.net.Socket.connect(Socket.java:519)
>         at 
> org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:159)
>         at org.apache.hadoop.ipc.Client.getConnection(Client.java:575)
>         at org.apache.hadoop.ipc.Client.call(Client.java:498)
>         at 
> org.apache.hadoop.hbase.ipc.HbaseRPC$Invoker.invoke(HbaseRPC.java:210)
>         at $Proxy0.regionServerReport(Unknown Source)
>         at org.apache.hadoop.hbase.HRegionServer.run(HRegionServer.java:898)
>         at java.lang.Thread.run(Thread.java:595)
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to