On Fri, Dec 21, 2012 at 12:02 PM, Andrew Purtell <apurt...@apache.org> wrote: > One of our large tests -- TestReplication -- is going to be moved out into > an integration test, and this will want to configure two independent > clusters with client access to both. Assume two 3 slave clusters, that's a > total of 8. Otherwise for single cluster tests 5 slaves seems a good > starting point IMO.
This will require a very particular Cluster deployment profile (instead of one cluster -- you actually need 2 clusters) -- something that we don't currently do, but would welcome contributions on how to make it happen. >> what are you expectations for tests that do manipulate the state of the > cluster (like ChaosMonkey) -- do you expect unrestricted ssh, etc? > > I think we want, at least, unrestricted access on the service accounts for > hdfs, hbase, zookeeper, so we can kill -9 or kill -STOP processes at > different system layers. So how does it work? I start a test on a node that has a role of Hadoo/HBase client and the test excepts that ssh'ing into the RSses as a root user would suffice? How does it kill thing? How does it restart things? Remember all of our deployments happen from packages -- are the tests aware of init.d scripts and such? Thanks, Roman.