[ https://issues.apache.org/jira/browse/HDFS-11192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15710722#comment-15710722 ]
Kihwal Lee commented on HDFS-11192: ----------------------------------- Was it hitting its ulimit on # of threads/processes? If that is the case, other bad things can happen even if this stage doesn't cause any trouble. E.g. replication queue init will be done by starting up a separate thread. If that doesn't happen, the user will be in a bigger trouble. A potential improvement will be to terminate NN if this happens. > OOM during Quota Initialization lead to Namenode hang > ----------------------------------------------------- > > Key: HDFS-11192 > URL: https://issues.apache.org/jira/browse/HDFS-11192 > Project: Hadoop HDFS > Issue Type: Bug > Reporter: Brahma Reddy Battula > Assignee: Brahma Reddy Battula > Attachments: namenodeThreadDump.out > > > AFAIK ,In RecurisveTask Execution, When ForkjoinThreadpool's thread dies or > not able to create,it will not notify the parent.Parent still waiting for the > notify call..that's not timed waiting also. > *Trace from Namenode log* > {noformat} > Exception in thread "ForkJoinPool-1-worker-2" Exception in thread > "ForkJoinPool-1-worker-3" java.lang.OutOfMemoryError: unable to create new > native thread > at java.lang.Thread.start0(Native Method) > at java.lang.Thread.start(Thread.java:714) > at > java.util.concurrent.ForkJoinPool.createWorker(ForkJoinPool.java:1486) > at > java.util.concurrent.ForkJoinPool.tryAddWorker(ForkJoinPool.java:1517) > at > java.util.concurrent.ForkJoinPool.deregisterWorker(ForkJoinPool.java:1609) > at > java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:167) > java.lang.OutOfMemoryError: unable to create new native thread > at java.lang.Thread.start0(Native Method) > at java.lang.Thread.start(Thread.java:714) > at > java.util.concurrent.ForkJoinPool.createWorker(ForkJoinPool.java:1486) > at > java.util.concurrent.ForkJoinPool.tryAddWorker(ForkJoinPool.java:1517) > at > java.util.concurrent.ForkJoinPool.deregisterWorker(ForkJoinPool.java:1609) > at > java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:167) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org