Can you show us the configuration files?
Maybe I can help you with some suggestions.


On Tue, Oct 3, 2023 at 9:05 AM Harry Jamison
<harryjamiso...@yahoo.com.invalid> wrote:

> I am trying to setup a HA HDFS cluster, and I am running into a problem
>
> I am not sure what I am doing wrong, I thought I followed the HA namenode
> guide, but it is not working.
>
>
> Apache Hadoop 3.3.6 – HDFS High Availability
> <https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithNFS.html>
>
>
>
> I have 2 namenodes and 3 journal nodes, and 3 zookeeper nodes.
>
> After some period of time I see the following and my namenode and journal
> node die.
> I am not sure where the problem is, or how to diagnose what I am doing
> wrong here.  And the logging here does not make sense to me.
>
> Namenode
> Serving checkpoints at http://nn1:9870
> (org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer)
>
> real-time non-blocking time  (microseconds, -R) unlimited
>
> core file size              (blocks, -c) 0
>
> data seg size               (kbytes, -d) unlimited
>
> scheduling priority                 (-e) 0
>
> file size                   (blocks, -f) unlimited
>
> pending signals                     (-i) 15187
>
> max locked memory           (kbytes, -l) 8192
>
> max memory size             (kbytes, -m) unlimited
>
> open files                          (-n) 1024
>
> pipe size                (512 bytes, -p) 8
>
> POSIX message queues         (bytes, -q) 819200
>
> real-time priority                  (-r) 0
>
> stack size                  (kbytes, -s) 8192
>
> cpu time                   (seconds, -t) unlimited
>
> max user processes                  (-u) 15187
>
> virtual memory              (kbytes, -v) unlimited
>
> file locks                          (-x) unlimited
>
> [2023-10-02 23:53:46,693] ERROR RECEIVED SIGNAL 15: SIGTERM
> (org.apache.hadoop.hdfs.server.namenode.NameNode)
>
> [2023-10-02 23:53:46,701] INFO SHUTDOWN_MSG:
>
> /************************************************************
>
> SHUTDOWN_MSG: Shutting down NameNode at nn1/192.168.1.159
>
> ************************************************************/
> (org.apache.hadoop.hdfs.server.namenode.NameNode)
>
> JournalNode
> [2023-10-02 23:54:19,162] WARN Journal at nn1/192.168.1.159:8485 has no
> edit logs (org.apache.hadoop.hdfs.qjournal.server.JournalNodeSyncer)
>
> real-time non-blocking time  (microseconds, -R) unlimited
>
> core file size              (blocks, -c) 0
>
> data seg size               (kbytes, -d) unlimited
>
> scheduling priority                 (-e) 0
>
> file size                   (blocks, -f) unlimited
>
> pending signals                     (-i) 15187
>
> max locked memory           (kbytes, -l) 8192
>
> max memory size             (kbytes, -m) unlimited
>
> open files                          (-n) 1024
>
> pipe size                (512 bytes, -p) 8
>
> POSIX message queues         (bytes, -q) 819200
>
> real-time priority                  (-r) 0
>
> stack size                  (kbytes, -s) 8192
>
> cpu time                   (seconds, -t) unlimited
>
> max user processes                  (-u) 15187
>
> virtual memory              (kbytes, -v) unlimited
>
> file locks                          (-x) unlimited
>
>
>

-- 
*Best*

Liming

Reply via email to