Second RS goes down because "Bad connect ack with firstBadLink as..."
---------------------------------------------------------------------
Key: HBASE-1973
URL: https://issues.apache.org/jira/browse/HBASE-1973
Project: Hadoop HBase
Issue Type: Bug
Reporter: stack
Placeholder for issue where a second RS goes down when I kill another RS and
its DN. I asked over on hdfs-user why we're stuck on 140. Why can't it move
on to another DN creating a block.
{code}
2009-11-12 05:09:38,624 [regionserver/208.76.44.141:60020.logRoller] INFO
org.apache.hadoop.hbase.regionserver.wal.HLog: Roll
/hbase/.logs/aa0-000-14.u.powerset.com,60020,1258002404935/hlog.dat.1258002405346,
entries=52747, calcsize=63757973, filesize=58588977. New hlog
/hbase/.logs/aa0-000-14.u.powerset.com,60020,12
58002404935/hlog.dat.1258002578617
2009-11-12 05:09:38,628 [regionserver/208.76.44.141:60020.logRoller] DEBUG
org.apache.hadoop.hbase.regionserver.wal.HLog: Found 0 hlogs to remove out of
total 1; oldest outstanding seqnum is 1 from region .META.,,1
2009-11-12 05:09:38,628 [IPC Server handler 16 on 60020] INFO
org.apache.hadoop.hbase.regionserver.wal.HLog: edit=0,
write=TestTable/TestTable,,1258002466308/612228
2009-11-12 05:09:38,630 [regionserver/208.76.44.141:60020.logSyncer] INFO
org.apache.hadoop.hbase.regionserver.wal.HLog: sync
2009-11-12 05:09:38,646 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Exception in createBlockOutputStream java.io.IOException: Bad connect ack with
firstBadLink as 208.76.44.140:51010
2009-11-12 05:09:38,646 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Abandoning block blk_1786802275426334496_1076
2009-11-12 05:09:44,662 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Exception in createBlockOutputStream java.io.IOException: Bad connect ack with
firstBadLink as 208.76.44.140:51010
2009-11-12 05:09:44,662 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Abandoning block blk_5838454320666652488_1076
2009-11-12 05:09:46,226 [pool-1-thread-1] DEBUG
org.apache.hadoop.hbase.io.hfile.LruBlockCache: Cache Stats: Sizes:
Total=5.5656967MB (5836056), Free=672.77185MB (705452392), Max=678.3375MB
(711288448), Counts: Blocks=0, Access=0, Hit=0, Miss=0, Evictions=0, Evicted=0,
Ratios: Hit Ratio=NaN%, Miss Ratio=NaN%, Evicted
/Run=NaN
2009-11-12 05:09:50,683 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Exception in createBlockOutputStream java.io.IOException: Bad connect ack with
firstBadLink as 208.76.44.140:51010
2009-11-12 05:09:50,684 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Abandoning block blk_8615264034947520229_1076
2009-11-12 05:09:56,691 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Exception in createBlockOutputStream java.io.IOException: Bad connect ack with
firstBadLink as 208.76.44.140:51010
2009-11-12 05:09:56,692 [Thread-61] INFO org.apache.hadoop.hdfs.DFSClient:
Abandoning block blk_-5632723632070749366_1077
2009-11-12 05:10:02,696 [Thread-61] WARN org.apache.hadoop.hdfs.DFSClient:
DataStreamer Exception: java.io.IOException: Unable to create new block.
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSClient.java:3100)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2681)
2009-11-12 05:10:02,697 [Thread-61] WARN org.apache.hadoop.hdfs.DFSClient:
Could not get block locations. Source file
"/hbase/.logs/aa0-000-14.u.powerset.com,60020,1258002404935/hlog.dat.1258002578617"
- Aborting...
2009-11-12 05:10:02,699 [regionserver/208.76.44.141:60020.logSyncer] FATAL
org.apache.hadoop.hbase.regionserver.wal.HLog: Could not append. Requesting
close of hlog
java.io.IOException: Bad connect ack with firstBadLink as 208.76.44.140:51010
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.createBlockOutputStream(DFSClient.java:3160)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSClient.java:3080)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2681)
2009-11-12 05:10:02,701 [regionserver/208.76.44.141:60020.logSyncer] ERROR
org.apache.hadoop.hbase.regionserver.wal.HLog: Error while syncing, requesting
close of hlog
java.io.IOException: Bad connect ack with firstBadLink as 208.76.44.140:51010
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.createBlockOutputStream(DFSClient.java:3160)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSClient.java:3080)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2681)
2009-11-12 05:10:02,703 [IPC Server handler 20 on 60020] FATAL
org.apache.hadoop.hbase.regionserver.wal.HLog: Could not append. Requesting
close of hlog
java.io.IOException: IOException flush:java.io.IOException: Bad connect ack
with firstBadLink as 208.76.44.140:51010
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.hflush(DFSClient.java:3527)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.sync(DFSClient.java:3473)
at
org.apache.hadoop.fs.FSDataOutputStream.sync(FSDataOutputStream.java:97)
at org.apache.hadoop.hbase.regionserver.wal.HLog.hflush(HLog.java:829)
at
org.apache.hadoop.hbase.regionserver.wal.HLog$LogSyncer.run(HLog.java:751)
2009-11-12 05:10:02,703 [regionserver/208.76.44.141:60020.logSyncer] INFO
org.apache.hadoop.hbase.regionserver.wal.HLog:
regionserver/208.76.44.141:60020.logSyncer exiting
2009-11-12 05:10:02,706 [IPC Server handler 10 on 60020] FATAL
org.apache.hadoop.hbase.regionserver.wal.HLog: Could not append. Requesting
close of hlog
java.io.IOException: IOException flush:java.io.IOException: Bad connect ack
with firstBadLink as 208.76.44.140:51010
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.hflush(DFSClient.java:3527)
at
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.sync(DFSClient.java:3473)
at
org.apache.hadoop.fs.FSDataOutputStream.sync(FSDataOutputStream.java:97)
at org.apache.hadoop.hbase.regionserver.wal.HLog.hflush(HLog.java:829)
at
org.apache.hadoop.hbase.regionserver.wal.HLog$LogSyncer.run(HLog.java:751)
{code}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.