Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Colin Kincaid Williams
Hello, I'm trying to swap out a standby NameNode in a QJM / HA configuration. I believe the steps to achieve this would be something similar to: Use the Bootstrap standby command to prep the replacment standby. Or rsync if the command fails. Somehow update the datanodes, so they push the heartbe

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Jing Zhao
Hi Colin, I guess currently we may have to restart almost all the daemons/services in order to swap out a standby NameNode (SBN): 1. The current active NameNode (ANN) needs to know the new SBN since in the current implementation the SBN tries to send rollEditLog RPC request to ANN periodicall

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Colin Kincaid Williams
Hi Jing, Thanks for the response. I will try this out, and file an Apache jira. Best, Colin Williams On Thu, Jul 31, 2014 at 11:19 AM, Jing Zhao wrote: > Hi Colin, > > I guess currently we may have to restart almost all the > daemons/services in order to swap out a standby NameNode (SBN)

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Bryan Beaudreault
We've done this a number of times without issue. Here's the general flow: 1) Shutdown namenode and zkfc on SNN 2) Stop zkfc on ANN (ANN will remain active because there is no other zkfc instance running to fail over to) 3) Run hdfs zkfc -formatZK on ANN 4) Start zkfc on ANN (will sync up with AN

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Colin Kincaid Williams
Thanks! I will give this a shot. On Thu, Jul 31, 2014 at 1:12 PM, Bryan Beaudreault wrote: > We've done this a number of times without issue. Here's the general flow: > > 1) Shutdown namenode and zkfc on SNN > 2) Stop zkfc on ANN (ANN will remain active because there is no other > zkfc insta

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Colin Kincaid Williams
On 3) Run hdfs zkfc -formatZK in my test environment, I get a Warning then an error WARNING: Before proceeding, ensure that all HDFS services and failover controllers are stopped! the complete output: sudo hdfs zkfc -formatZK 2014-07-31 17:43:07,952 INFO [main] tools.DFSZKFailoverController (

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Colin Kincaid Williams
Another error after stopping the zkfc. Do I have to take the cluster down to format ZK? [root@rhel1 conf]# sudo service hadoop-hdfs-zkfc stop Stopping Hadoop zkfc: [ OK ] stopping zkfc [root@rhel1 conf]# sudo -u hdfs zkfc -formatZK sudo: zkfc: command not fou

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Colin Kincaid Williams
ode > in a > > QJM / HA configuration. See you around on #hbase. If you visit Seattle, > feel > > free to give me a shout out. > > > > -- Forwarded message -- > > From: Colin Kincaid Williams > > Date: Thu, Jul 31, 2014 at 12:35 PM > >

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Colin Kincaid Williams
#x27;t have that problem here at WanDISCO. >> >> On Thu, Jul 31, 2014 at 12:46 PM, Colin Kincaid Williams >> wrote: >> > Hi this is drocsid / discord from #hbase. Thanks for the help earlier >> today. >> > Just thought I'd forward this info

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-07-31 Thread Bryan Beaudreault
gt; (ActiveStandbyElector.java:processWatchEvent(538)) - Session connected. >> Y >> 2014-07-31 18:08:00,439 INFO [main] ha.ActiveStandbyElector >> (ActiveStandbyElector.java:clearParentZNode(314)) - Recursively deleting >> /hadoop-ha/golden-apple from ZK... >> 2014-

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-08-01 Thread Colin Kincaid Williams
l:2181)] >>> zookeeper.ClientCnxn (ClientCnxn.java:onConnected(1187)) - Session >>> establishment complete on server rhel1.local/10.120.5.203:2181, >>> sessionid = 0x1478902fddc000a, negotiated timeout = 5000 >>> =

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-08-01 Thread Bryan Beaudreault
; 2014-07-31 18:07:51,802 INFO [main] zookeeper.ZooKeeper >>>> (Environment.java:logEnv(100)) - Client >>>> environment:user.dir=/etc/hbase/conf.golden_apple >>>> 2014-07-31 18:07:51,813 INFO [main] zookeeper.ZooKeeper >>>> (ZooKeeper.java:(4

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-08-01 Thread Bryan Beaudreault
(Environment.java:logEnv(100)) - Client environment:java.io.tmpdir=/tmp >>>>> 2014-07-31 18:07:51,801 INFO [main] zookeeper.ZooKeeper >>>>> (Environment.java:logEnv(100)) - Client environment:java.compiler= >>>>> 2014-07-31 18:07:51,801 INFO

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-08-01 Thread Colin Kincaid Williams
op-core-2.0.0-mr1-cdh4.1.3.jar:/usr/lib/hadoop-0.20-mapreduce/.//hadoop-2.0.0-mr1-cdh4.1.3-examples.jar:/usr/lib/hadoop-0.20-mapreduce/.//hadoop-test-2.0.0-mr1-cdh4.1.3.jar:/usr/lib/hadoop-0.20-mapreduce/.//hadoop-tools-2.0.0-mr1-cdh4.1.3.jar:/usr/lib/hadoop-0.20-mapreduce/.//hadoop-examples

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-08-01 Thread Bryan Beaudreault
loudera.2.jar:/usr/lib/hadoop-0.20-mapreduce/lib/snappy-java-1.0.4.1.jar:/usr/lib/hadoop-0.20-mapreduce/lib/junit-4.8.2.jar:/usr/lib/hadoop-0.20-mapreduce/lib/stax-api-1.0.1.jar:/usr/lib/hadoop-0.20-mapreduce/lib/commons-math-2.1.jar:/usr/lib/hadoop-0.20-mapreduce/lib/mockito-all-1.8.5.jar:/usr

Re: Juggling or swaping out the standby NameNode in a QJM / HA configuration

2014-08-01 Thread Colin Kincaid Williams
uce/lib/jersey-server-1.8.jar:/usr/lib/hadoop-0.20-mapreduce/lib/jasper-compiler-5.5.23.jar:/usr/lib/hadoop-0.20-mapreduce/lib/jackson-jaxrs-1.8.8.jar:/usr/lib/hadoop-0.20-mapreduce/lib/commons-el-1.0.jar:/usr/lib/hadoop-0.20-mapreduce/lib/commons-io-2.1.jar:/usr/lib/hadoop-0.20-mapreduce/lib/common