Have you read '9.3.4. Better Mean Time to Recover (MTTR)' under http://hbase.apache.org/book.html#important_configurations ?
For #2, can you take a look at https://issues.apache.org/jira/browse/HBASE-10070 ? Cheers On Thu, Feb 19, 2015 at 12:46 AM, Serega Sheypak <[email protected]> wrote: > Hi, we are running HBase on super-low-cost HW :) > Sometimes random node goes down. And HBase needs a time to move regions > from failed RS. > > What are the practices to: > 1. minimize MTTR? > 2. is there any possibility to gracefully handle situation when region is > not accessible for r/w? > I can just drop the data, but I can't wait seconds or minutes for response. > I would like to break request after 100ms and return empty result. >
