[ 
https://issues.apache.org/jira/browse/NIFI-6763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16948064#comment-16948064
 ] 

Shayne Burgess commented on NIFI-6763:
--------------------------------------

I have a large NIFI cluster (300 nodes) with no data flow running on it and I 
am seeing UI stability issues. 

The log for the coordinator node is running out of file handles (we have set 
the limit to 50000)
{code:java}
2019-10-09 22:51:17,193 WARN [Cluster Socket Listener] 
org.apache.nifi.io.socket.SocketListener Failed to communicate with Unknown 
Host due to java.net.SocketException: Too many open files (Accept failed) 
java.net.SocketException: Too many open files (Accept failed)     at 
java.net.PlainSocketImpl.socketAccept(Native Method)     at 
java.net.AbstractPlainSocketImpl.accept(AbstractPlainSocketImpl.java:409)     
at java.net.ServerSocket.implAccept(ServerSocket.java:545)     at 
java.net.ServerSocket.accept(ServerSocket.java:513)     at 
org.apache.nifi.io.socket.SocketListener$2.run(SocketListener.java:112)     at 
java.lang.Thread.run(Thread.java:748)
{code}
All nodes are regularly showing timeout 
{code:java}
2019-10-09 22:55:14,852 WARN [Replicate Request Thread-1] 
o.a.n.c.c.h.r.ThreadPoolRequestReplicator Failed to replicate request GET 
/nifi-api/system-diagnostics to cj-vmss-300nodes-nifi00004O:8080 due to 
java.net.SocketTimeoutException: timeout
2019-10-09 22:55:14,852 WARN [Replicate Request Thread-1] 
o.a.n.c.c.h.r.ThreadPoolRequestReplicator
java.net.SocketTimeoutException: timeout
    at okio.Okio$4.newTimeoutException(Okio.java:232)
    at okio.AsyncTimeout.exit(AsyncTimeout.java:285)
    at okio.AsyncTimeout$2.read(AsyncTimeout.java:241)
    at okio.RealBufferedSource.indexOf(RealBufferedSource.java:355)
    at okio.RealBufferedSource.readUtf8LineStrict(RealBufferedSource.java:227)
    at okhttp3.internal.http1.Http1Codec.readHeaderLine(Http1Codec.java:215)
    at 
okhttp3.internal.http1.Http1Codec.readResponseHeaders(Http1Codec.java:189)
    at 
okhttp3.internal.http.CallServerInterceptor.intercept(CallServerInterceptor.java:88)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.connection.ConnectInterceptor.intercept(ConnectInterceptor.java:45)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:121)
    at 
okhttp3.internal.cache.CacheInterceptor.intercept(CacheInterceptor.java:93)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:121)
    at 
okhttp3.internal.http.BridgeInterceptor.intercept(BridgeInterceptor.java:93)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RetryAndFollowUpInterceptor.intercept(RetryAndFollowUpInterceptor.java:126)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:121)
    at okhttp3.RealCall.getResponseWithInterceptorChain(RealCall.java:200)
    at okhttp3.RealCall.execute(RealCall.java:77)
    at 
org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:138)
    at 
org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:132)
    at 
org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator.replicateRequest(ThreadPoolRequestReplicator.java:647)
    at 
org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator$NodeHttpRequest.run(ThreadPoolRequestReplicator.java:839)
    at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
    at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
    at java.lang.Thread.run(Thread.java:748)
Caused by: java.net.SocketException: Socket closed
    at java.net.SocketInputStream.read(SocketInputStream.java:204)
    at java.net.SocketInputStream.read(SocketInputStream.java:141)
    at okio.Okio$2.read(Okio.java:140)
    at okio.AsyncTimeout$2.read(AsyncTimeout.java:237)
    ... 28 common frames omitted
{code}
When the UI times out, out the following exception is thrown
{code:java}
2019-10-09 22:57:18,631 WARN [Replicate Request Thread-1] 
o.a.n.c.c.h.r.ThreadPoolRequestReplicator Failed to replicate request GET 
/nifi-api/flow/current-user to cj-vmss-300nodes-nifi00004O:8080 due to 
java.net.SocketTimeoutException: timeout
2019-10-09 22:57:18,631 WARN [Replicate Request Thread-1] 
o.a.n.c.c.h.r.ThreadPoolRequestReplicator
java.net.SocketTimeoutException: timeout
    at okio.Okio$4.newTimeoutException(Okio.java:232)
    at okio.AsyncTimeout.exit(AsyncTimeout.java:285)
    at okio.AsyncTimeout$2.read(AsyncTimeout.java:241)
    at okio.RealBufferedSource.indexOf(RealBufferedSource.java:355)
    at okio.RealBufferedSource.readUtf8LineStrict(RealBufferedSource.java:227)
    at okhttp3.internal.http1.Http1Codec.readHeaderLine(Http1Codec.java:215)
    at 
okhttp3.internal.http1.Http1Codec.readResponseHeaders(Http1Codec.java:189)
    at 
okhttp3.internal.http.CallServerInterceptor.intercept(CallServerInterceptor.java:88)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.connection.ConnectInterceptor.intercept(ConnectInterceptor.java:45)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:121)
    at 
okhttp3.internal.cache.CacheInterceptor.intercept(CacheInterceptor.java:93)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:121)
    at 
okhttp3.internal.http.BridgeInterceptor.intercept(BridgeInterceptor.java:93)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RetryAndFollowUpInterceptor.intercept(RetryAndFollowUpInterceptor.java:126)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:147)
    at 
okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.java:121)
    at okhttp3.RealCall.getResponseWithInterceptorChain(RealCall.java:200)
    at okhttp3.RealCall.execute(RealCall.java:77)
    at 
org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:138)
    at 
org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:132)
    at 
org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator.replicateRequest(ThreadPoolRequestReplicator.java:647)
    at 
org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator$NodeHttpRequest.run(ThreadPoolRequestReplicator.java:839)
    at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
    at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
    at java.lang.Thread.run(Thread.java:748)
Caused by: java.net.SocketException: Socket closed
    at java.net.SocketInputStream.read(SocketInputStream.java:204)
    at java.net.SocketInputStream.read(SocketInputStream.java:141)
    at okio.Okio$2.read(Okio.java:140)
    at okio.AsyncTimeout$2.read(AsyncTimeout.java:237)
    ... 28 common frames omitted
{code}

> UI Timeout on large NIFI cluster
> --------------------------------
>
>                 Key: NIFI-6763
>                 URL: https://issues.apache.org/jira/browse/NIFI-6763
>             Project: Apache NiFi
>          Issue Type: Bug
>            Reporter: Shayne Burgess
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to