[ 
https://issues.apache.org/jira/browse/HDFS-4957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15628097#comment-15628097
 ] 

Aliza Itzkowitz commented on HDFS-4957:
---------------------------------------

I am facing the following scenario:
I have an HA HDFS setup where one of the JN is colocated on the same machine as 
the active NN.
When I shut down the active NN machine, I expect he standby NN to become 
active. However, the automatic failover fails with the following exception:
java.lang.IllegalArgumentException: Unable to construct journal, 
qjournal://journalnode1.service.consul:8485;journalnode2.service.consul:8485;journalnode3.service.consul:8485/hdfs

In my environment, JN1 was on the machine which was shut sown so the DNS  
journalnode1.service.consul is not resolvable and the entire failover fails.
I expect the failover to succeed since I sill have a majority of JN (2 out of 
3).


> NameNode failover should not fail because a DNS entry for a quorum node 
> cannot be resolved
> ------------------------------------------------------------------------------------------
>
>                 Key: HDFS-4957
>                 URL: https://issues.apache.org/jira/browse/HDFS-4957
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: qjm
>    Affects Versions: 2.3.0, 2.6.0
>            Reporter: Colin P. McCabe
>            Assignee: John Zhuge
>
> When a StandbyNameNode is becoming active, we should not bail out because a 
> DNS entry for a quorum node cannot be resolved.  Currently it does fail in 
> this scenario, with a message like this:
> {code}
> 2013-07-03 21:28:40,576 INFO 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Starting services 
> required for active state
> 2013-07-03 21:28:40,579 FATAL 
> org.apache.hadoop.hdfs.server.namenode.NameNode: Error encountered requiring 
> NN shutdown. Shutting down immediately.
> java.lang.IllegalArgumentException: Unable to construct journal, 
> qjournal://hadoop-mm:8485;hadoop-nn-0:8485;hadoop-nn-1:8485/hadoop
> at 
> org.apache.hadoop.hdfs.server.namenode.FSEditLog.createJournal(FSEditLog.java:1254)
> at 
> org.apache.hadoop.hdfs.server.namenode.FSEditLog.initJournals(FSEditLog.java:226)
> at 
> org.apache.hadoop.hdfs.server.namenode.FSEditLog.initJournalsForWrite(FSEditLog.java:193)
> at 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startActiveServices(FSNamesystem.java:722)
> <etc>
> {code}
> reported by Matt Bookman



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to