Hi All,

We have a strange issue with HBase performance (overall cluster
performance) in case one of datanodes in the cluster unexpectedly goes
down.

So scenario is like follows:
- Cluster works fine, it's stable.
- One DataNode unexpectedly goes down (PSU issue, network issue, anything)
- Whole HBase cluster goes down (performance becomes so bad that we have to
restart all RegionServers to get it back to life).

Most funny and latest issue that happened was that we added new node to the
cluster (having 8 x 4T SATA disks) and we left just DataNode running on it
to give it couple of days to get some data. At some point in time, due to
hardware issue, server rebooted (twice during three hours) in moment when
it had maybe 5% of data it would have in a couple of days. Nothing else
beside DataNode was running, and once it went down, it affected literary
everything, and restarting RegionServers in the end fixed it.

We are using HBase 0.98.0 with Hadoop 2.4.0

Reply via email to