Virag Kothari created HBASE-12450: ------------------------------------- Summary: Unbalance chaos monkey might kill all region servers without starting them back Key: HBASE-12450 URL: https://issues.apache.org/jira/browse/HBASE-12450 Project: HBase Issue Type: Bug Reporter: Virag Kothari Assignee: Virag Kothari Priority: Minor
UnbalanceKillAndRebalanceAction does kill, balance and then start of region servers. But if the balance fails exception is thrown causing the region servers to not start. For me, the balance always kept on failing with socket timeout (default 1 min) as master runs one iteration of balance for 5 mins (default config). Eventually all servers are killed but never started back. -- This message was sent by Atlassian JIRA (v6.3.4#6332)