Can you show us the configuration files? Maybe I can help you with some suggestions.
On Tue, Oct 3, 2023 at 9:05 AM Harry Jamison <harryjamiso...@yahoo.com.invalid> wrote: > I am trying to setup a HA HDFS cluster, and I am running into a problem > > I am not sure what I am doing wrong, I thought I followed the HA namenode > guide, but it is not working. > > > Apache Hadoop 3.3.6 – HDFS High Availability > <https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithNFS.html> > > > > I have 2 namenodes and 3 journal nodes, and 3 zookeeper nodes. > > After some period of time I see the following and my namenode and journal > node die. > I am not sure where the problem is, or how to diagnose what I am doing > wrong here. And the logging here does not make sense to me. > > Namenode > Serving checkpoints at http://nn1:9870 > (org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer) > > real-time non-blocking time (microseconds, -R) unlimited > > core file size (blocks, -c) 0 > > data seg size (kbytes, -d) unlimited > > scheduling priority (-e) 0 > > file size (blocks, -f) unlimited > > pending signals (-i) 15187 > > max locked memory (kbytes, -l) 8192 > > max memory size (kbytes, -m) unlimited > > open files (-n) 1024 > > pipe size (512 bytes, -p) 8 > > POSIX message queues (bytes, -q) 819200 > > real-time priority (-r) 0 > > stack size (kbytes, -s) 8192 > > cpu time (seconds, -t) unlimited > > max user processes (-u) 15187 > > virtual memory (kbytes, -v) unlimited > > file locks (-x) unlimited > > [2023-10-02 23:53:46,693] ERROR RECEIVED SIGNAL 15: SIGTERM > (org.apache.hadoop.hdfs.server.namenode.NameNode) > > [2023-10-02 23:53:46,701] INFO SHUTDOWN_MSG: > > /************************************************************ > > SHUTDOWN_MSG: Shutting down NameNode at nn1/192.168.1.159 > > ************************************************************/ > (org.apache.hadoop.hdfs.server.namenode.NameNode) > > JournalNode > [2023-10-02 23:54:19,162] WARN Journal at nn1/192.168.1.159:8485 has no > edit logs (org.apache.hadoop.hdfs.qjournal.server.JournalNodeSyncer) > > real-time non-blocking time (microseconds, -R) unlimited > > core file size (blocks, -c) 0 > > data seg size (kbytes, -d) unlimited > > scheduling priority (-e) 0 > > file size (blocks, -f) unlimited > > pending signals (-i) 15187 > > max locked memory (kbytes, -l) 8192 > > max memory size (kbytes, -m) unlimited > > open files (-n) 1024 > > pipe size (512 bytes, -p) 8 > > POSIX message queues (bytes, -q) 819200 > > real-time priority (-r) 0 > > stack size (kbytes, -s) 8192 > > cpu time (seconds, -t) unlimited > > max user processes (-u) 15187 > > virtual memory (kbytes, -v) unlimited > > file locks (-x) unlimited > > > -- *Best* Liming