[ https://issues.apache.org/jira/browse/NIFI-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061105#comment-16061105 ]
ASF GitHub Bot commented on NIFI-4073: -------------------------------------- Github user alopresto commented on the issue: https://github.com/apache/nifi/pull/1916 Ran `contrib-check` and all tests pass. +1, merging. I did start a two node cluster and ensure it was communicating, then used `pf` to temporarily block the cluster port to trigger the error message. I did not see the duplicated stack trace. ``` # Enabled port blocking 2017-06-23 11:12:55,991 INFO [Clustering Tasks Thread-1] o.a.n.c.c.ClusterProtocolHeartbeater Heartbeat created at 2017-06-23 11:12:55,987 and sent to localhost:10000 at 2017-06-23 11:12:55,991; send took 3 millis 2017-06-23 11:12:56,561 INFO [Process Cluster Protocol Request-10] o.a.n.c.p.impl.SocketProtocolListener Finished processing request 3ecf8509-0b0b-4602-ac55-1ec9057dc3c3 (type=HEARTBEAT, length=2344 bytes) from localhost:8081 in 1 millis 2017-06-23 11:12:57,293 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 6930 nanos 2017-06-23 11:13:00,433 INFO [Write-Ahead Local State Provider Maintenance] org.wali.MinimalLockingWriteAheadLog org.wali.MinimalLockingWriteAheadLog@7a80ebcd checkpointed with 1 Records and 0 Swap Files in 12 milliseconds (Stop-the-world time = 4 milliseconds, Clear Edit Logs time = 4 millis), max Transaction ID 0 2017-06-23 11:13:00,998 INFO [Process Cluster Protocol Request-1] o.a.n.c.p.impl.SocketProtocolListener Finished processing request dc71898f-f3e6-4aaa-af2e-b0dae65e4f53 (type=HEARTBEAT, length=2344 bytes) from localhost:8080 in 1 millis 2017-06-23 11:13:00,999 INFO [Clustering Tasks Thread-1] o.a.n.c.c.ClusterProtocolHeartbeater Heartbeat created at 2017-06-23 11:13:00,995 and sent to localhost:10000 at 2017-06-23 11:13:00,999; send took 4 millis 2017-06-23 11:13:01,572 INFO [Process Cluster Protocol Request-2] o.a.n.c.p.impl.SocketProtocolListener Finished processing request 83d6969d-6073-43d4-b8e3-c182d4feea48 (type=HEARTBEAT, length=2344 bytes) from localhost:8081 in 2 millis 2017-06-23 11:13:02,296 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 6734 nanos 2017-06-23 11:13:07,300 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 5332 nanos 2017-06-23 11:13:12,305 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 6497 nanos 2017-06-23 11:13:17,308 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 5851 nanos 2017-06-23 11:13:22,314 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 5554 nanos 2017-06-23 11:13:27,318 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 5786 nanos 2017-06-23 11:13:32,045 WARN [Clustering Tasks Thread-1] o.apache.nifi.controller.FlowController Failed to send heartbeat due to: org.apache.nifi.cluster.protocol.ProtocolException: Failed to send message to Cluster Coordinator due to: java.net.ConnectException: Operation timed out 2017-06-23 11:13:32,323 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 6357 nanos 2017-06-23 11:13:37,328 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 6451 nanos 2017-06-23 11:13:42,333 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 2 heartbeats in 5380 nanos 2017-06-23 11:13:42,333 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator localhost:8081 requested disconnection from cluster due to Have not received a heartbeat from node in 40 seconds 2017-06-23 11:13:42,333 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Status of localhost:8081 changed from NodeConnectionStatus[nodeId=localhost:8081, state=CONNECTED, updateId=125] to NodeConnectionStatus[nodeId=localhost:8081, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 40 seconds, updateId=126] 2017-06-23 11:13:42,334 WARN [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for localhost:8081 -- Node disconnected from cluster due to Have not received a heartbeat from node in 40 seconds 2017-06-23 11:13:42,334 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator localhost:8080 requested disconnection from cluster due to Have not received a heartbeat from node in 41 seconds 2017-06-23 11:13:42,334 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Status of localhost:8080 changed from NodeConnectionStatus[nodeId=localhost:8080, state=CONNECTED, updateId=121] to NodeConnectionStatus[nodeId=localhost:8080, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 41 seconds, updateId=127] 2017-06-23 11:13:42,335 WARN [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for localhost:8080 -- Node disconnected from cluster due to Have not received a heartbeat from node in 41 seconds 2017-06-23 11:13:54,349 INFO [NiFi Web Server-183] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:13:54,349 INFO [NiFi Web Server-141] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:13:54,349 INFO [NiFi Web Server-158] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:13:54,352 INFO [NiFi Web Server-211] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:14:03,063 WARN [Clustering Tasks Thread-1] o.apache.nifi.controller.FlowController Failed to send heartbeat due to: org.apache.nifi.cluster.protocol.ProtocolException: Failed to send message to Cluster Coordinator due to: java.net.ConnectException: Operation timed out # Disabled port blocking 2017-06-23 11:13:42,335 WARN [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for localhost:8080 -- Node disconnected from cluster due to Have not received a heartbeat from node in 41 seconds 2017-06-23 11:13:54,349 INFO [NiFi Web Server-183] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:13:54,349 INFO [NiFi Web Server-141] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:13:54,349 INFO [NiFi Web Server-158] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:13:54,352 INFO [NiFi Web Server-211] o.a.n.c.m.e.NoConnectedNodesException Cluster failed processing request: org.apache.nifi.cluster.manager.exception.NoConnectedNodesException. Returning Internal Server Error response. 2017-06-23 11:14:03,063 WARN [Clustering Tasks Thread-1] o.apache.nifi.controller.FlowController Failed to send heartbeat due to: org.apache.nifi.cluster.protocol.ProtocolException: Failed to send message to Cluster Coordinator due to: java.net.ConnectException: Operation timed out 2017-06-23 11:14:05,343 INFO [pool-12-thread-1] o.a.n.c.r.WriteAheadFlowFileRepository Initiating checkpoint of FlowFile Repository 2017-06-23 11:14:05,523 INFO [pool-12-thread-1] org.wali.MinimalLockingWriteAheadLog org.wali.MinimalLockingWriteAheadLog@3f4b59fd checkpointed with 0 Records and 0 Swap Files in 180 milliseconds (Stop-the-world time = 70 milliseconds, Clear Edit Logs time = 93 millis), max Transaction ID -1 2017-06-23 11:14:05,523 INFO [pool-12-thread-1] o.a.n.c.r.WriteAheadFlowFileRepository Successfully checkpointed FlowFile Repository with 0 records in 180 milliseconds 2017-06-23 11:14:34,103 WARN [Clustering Tasks Thread-1] o.apache.nifi.controller.FlowController Failed to send heartbeat due to: org.apache.nifi.cluster.protocol.ProtocolException: Failed to send message to Cluster Coordinator due to: java.net.ConnectException: Operation timed out 2017-06-23 11:14:52,374 INFO [Process Cluster Protocol Request-3] o.a.n.c.p.impl.SocketProtocolListener Finished processing request e636ce6d-32b2-43f1-b4f5-57fcf9a31f25 (type=HEARTBEAT, length=2724 bytes) from localhost:8080 in 1 millis 2017-06-23 11:14:52,376 INFO [Clustering Tasks Thread-1] o.a.n.c.c.ClusterProtocolHeartbeater Heartbeat created at 2017-06-23 11:14:39,108 and sent to localhost:10000 at 2017-06-23 11:14:52,376; send took 13268 millis 2017-06-23 11:14:52,388 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for localhost:8080 -- Received heartbeat from node previously disconnected due to Lack of Heartbeat. Issuing reconnection request. 2017-06-23 11:14:52,388 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for localhost:8080 -- Requesting that node connect to cluster 2017-06-23 11:14:52,388 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Status of localhost:8080 changed from NodeConnectionStatus[nodeId=localhost:8080, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 41 seconds, updateId=127] to NodeConnectionStatus[nodeId=localhost:8080, state=CONNECTING, updateId=128] 2017-06-23 11:14:52,390 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.h.AbstractHeartbeatMonitor Finished processing 1 heartbeats in 1 millis 2017-06-23 11:14:52,395 INFO [Process Cluster Protocol Request-4] o.a.n.c.p.impl.SocketProtocolListener Finished processing request c9b89497-1cf4-452e-a7e3-92975222f0b7 (type=RECONNECTION_REQUEST, length=2217 bytes) from localhost:8080 in 2 millis 2017-06-23 11:14:52,395 INFO [Reconnect to Cluster] o.a.nifi.controller.StandardFlowService Processing reconnection request from manager. 2017-06-23 11:14:52,395 INFO [Reconnect to Cluster] o.a.n.c.c.node.NodeClusterCoordinator Resetting cluster node statuses from {localhost:8081=NodeConnectionStatus[nodeId=localhost:8081, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 40 seconds, updateId=126], localhost:8080=NodeConnectionStatus[nodeId=localhost:8080, state=CONNECTING, updateId=128]} to {localhost:8081=NodeConnectionStatus[nodeId=localhost:8081, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 40 seconds, updateId=126], localhost:8080=NodeConnectionStatus[nodeId=localhost:8080, state=CONNECTING, updateId=128]} 2017-06-23 11:14:52,396 INFO [Reconnect localhost:8080] o.a.n.c.c.node.NodeClusterCoordinator Successfully requested that localhost:8080 join the cluster 2017-06-23 11:14:52,404 INFO [Reconnect to Cluster] o.apache.nifi.controller.FlowController Successfully synchronized controller with proposed flow 2017-06-23 11:14:52,413 INFO [Reconnect to Cluster] o.a.nifi.controller.StandardFlowService Setting Flow Controller's Node ID: localhost:8080 2017-06-23 11:14:52,413 INFO [Reconnect to Cluster] o.a.n.c.c.node.NodeClusterCoordinator This node is now connected to the cluster. Will no longer require election of DataFlow. 2017-06-23 11:14:52,413 INFO [Reconnect to Cluster] o.apache.nifi.controller.FlowController Starting 0 processors/ports/funnels 2017-06-23 11:14:52,414 INFO [Reconnect to Cluster] o.apache.nifi.controller.FlowController Started 0 Remote Group Ports transmitting 2017-06-23 11:14:52,414 INFO [Reconnect to Cluster] o.a.n.c.c.node.NodeClusterCoordinator Resetting cluster node statuses from {localhost:8081=NodeConnectionStatus[nodeId=localhost:8081, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 40 seconds, updateId=126], localhost:8080=NodeConnectionStatus[nodeId=localhost:8080, state=CONNECTING, updateId=128]} to {localhost:8081=NodeConnectionStatus[nodeId=localhost:8081, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 40 seconds, updateId=126], localhost:8080=NodeConnectionStatus[nodeId=localhost:8080, state=CONNECTING, updateId=128]} 2017-06-23 11:14:52,416 INFO [Reconnect to Cluster] o.a.nifi.controller.StandardFlowService Node reconnected. 2017-06-23 11:14:52,977 INFO [Process Cluster Protocol Request-5] o.a.n.c.p.impl.SocketProtocolListener Finished processing request c42f9b35-953d-47dc-a2f9-952307c60461 (type=HEARTBEAT, length=2344 bytes) from localhost:8081 in 1 millis 2017-06-23 11:14:57,396 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for localhost:8081 -- Received heartbeat from node previously disconnected due to Lack of Heartbeat. Issuing reconnection request. 2017-06-23 11:14:57,396 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for localhost:8081 -- Requesting that node connect to cluster 2017-06-23 11:14:57,396 INFO [Heartbeat Monitor Thread-1] o.a.n.c.c.node.NodeClusterCoordinator Status of localhost:8081 changed from NodeConnectionStatus[nodeId=localhost:8081, state=DISCONNECTED, Disconnect Code=Lack of Heartbeat, Disconnect Reason=Have not received a heartbeat from node in 40 seconds, updateId=126] to NodeConnectionStatus[nodeId=localhost:8081, state=CONNECTING, updateId=130] ``` > Duplicated stack trace in ThreadPoolRequestReplicator > ----------------------------------------------------- > > Key: NIFI-4073 > URL: https://issues.apache.org/jira/browse/NIFI-4073 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework > Affects Versions: 1.3.0 > Reporter: Pierre Villard > Assignee: Pierre Villard > Priority: Trivial > > There is a duplicated stack trace due to the following part. > {code:title=ThreadPoolRequestReplicator.java|borderStyle=solid} > try { > // create and send the request > final WebResource.Builder resourceBuilder = createResourceBuilder(); > final String requestId = headers.get("x-nifi-request-id"); > logger.debug("Replicating request {} {} to {}", method, uri.getPath(), > nodeId); > nodeResponse = replicateRequest(resourceBuilder, nodeId, method, uri, > requestId, headers, clusterResponse); > } catch (final Exception e) { > nodeResponse = new NodeResponse(nodeId, method, uri, e); > logger.warn("Failed to replicate request {} {} to {} due to {}", method, > uri.getPath(), nodeId, e); > logger.warn("", e); > } > {code} > It just requires to use {{e.toString()}}. -- This message was sent by Atlassian JIRA (v6.4.14#64029)