Hi,

We are observing very high CPU load(400 to 600%) in one of our RegionServer to 
the point where the machine is becoming unresponsive.

At this point the whole cluster of 20+ RegionServers becoming unresponsive.

Before cluster becomes unresponsive we observed following symptoms:

  *   Huge bandwidth spike
  *   CPU spikes vertically form normal load to very high usage only in one 
RegionServer
  *   Few times even though machine is unresponsive, it sending heartbeats to 
master
  *   There is no spike in number of requests to HBase
  *   We are observed this pattern at least twice is last week
  *   We don't have any co-processors in any of the region servers

What could be the possible reasons for this kind of behaviour.

We are using hbase-0.98.7, hadoop-2.5.1 versions.
Its production cluster so upgrading to latest version will not be possible 
right away.


Thanks,
Sandeep.



Thanks,
Sandeep.

Reply via email to