Joseph Niemiec created METRON-841: ------------------------------------- Summary: Failed to connect node to cluster due to: java.lang.NullPointerException Key: METRON-841 URL: https://issues.apache.org/jira/browse/METRON-841 Project: Metron Issue Type: Bug Environment: SUSE 11 SP3, NIFI 1.1.0.2.1.2.0-10, Java jdk1.8.0_60, 2 node clustered Nifi Reporter: Joseph Niemiec
We had a server fill its root directory, we stopped NiFi, moved the repositories to mounts with space, updated the configs and now we have a single node that will not join the cluster. There are no major errors but we do run into some WARNS talking about an NPE before Jetty explodes. 2017-04-10 14:55:57,293 ERROR [main] o.a.nifi.controller.StandardFlowService Failed to load flow from cluster due to: org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException at org.apache.nifi.controller.StandardFlowService.loadFromConnectionResponse(StandardFlowService.java:901) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.StandardFlowService.load(StandardFlowService.java:493) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.web.server.JettyServer.start(JettyServer.java:831) [nifi-jetty-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.NiFi.<init>(NiFi.java:156) [nifi-runtime-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.NiFi.main(NiFi.java:262) [nifi-runtime-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] Caused by: java.lang.NullPointerException: null at org.apache.nifi.controller.repository.SchemaRepositoryRecordSerde.deserializeRecord(SchemaRepositoryRecordSerde.java:119) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.repository.SchemaRepositoryRecordSerde.deserializeEdit(SchemaRepositoryRecordSerde.java:109) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.repository.SchemaRepositoryRecordSerde.deserializeEdit(SchemaRepositoryRecordSerde.java:46) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.wali.MinimalLockingWriteAheadLog$Partition.recoverNextTransaction(MinimalLockingWriteAheadLog.java:1072) ~[nifi-write-ahead-log-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.wali.MinimalLockingWriteAheadLog.recoverFromEdits(MinimalLockingWriteAheadLog.java:459) ~[nifi-write-ahead-log-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.wali.MinimalLockingWriteAheadLog.recoverRecords(MinimalLockingWriteAheadLog.java:301) ~[nifi-write-ahead-log-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.repository.WriteAheadFlowFileRepository.loadFlowFiles(WriteAheadFlowFileRepository.java:346) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.FlowController.initializeFlow(FlowController.java:699) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.StandardFlowService.loadFromBytes(StandardFlowService.java:701) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.StandardFlowService.loadFromConnectionResponse(StandardFlowService.java:872) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] ... 4 common frames omitted 2017-04-10 14:55:57,293 INFO [main] o.a.n.c.c.node.NodeClusterCoordinator hpce2r01n11.abc.com:9090 requested disconnection from cluster due to org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException 2017-04-10 14:55:57,293 INFO [main] o.a.n.c.c.node.NodeClusterCoordinator Status of hpce2r01n11.abc.com:9090 changed from NodeConnectionStatus[nodeId=hpce2r01n11.abc.com:9090, state=CONNECTING, updateId=60] to NodeConnectionStatus[nodeId=hpce2r01n11.abc.com:9090, state=DISCONNECTED, Disconnect Code=Node Failed to Startup Properly, Disconnect Reason=org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException, updateId=60] 2017-04-10 14:55:57,293 DEBUG [main] o.a.n.c.c.node.NodeClusterCoordinator State of cluster nodes is now {hpce2r01n12.abc.com:9090=NodeConnectionStatus[nodeId=hpce2r01n12.abc.com:9090, state=CONNECTED, updateId=54], hpce2r01n11.abc.com:9090=NodeConnectionStatus[nodeId=hpce2r01n11.abc.com:9090, state=DISCONNECTED, Disconnect Code=Node Failed to Startup Properly, Disconnect Reason=org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException, updateId=60]} 2017-04-10 14:55:57,294 DEBUG [main] o.a.n.c.c.node.NodeClusterCoordinator Notifying cluster coordinator that node status changed from NodeConnectionStatus[nodeId=hpce2r01n11.abc.com:9090, state=CONNECTING, updateId=60] to NodeConnectionStatus[nodeId=hpce2r01n11.abc.com:9090, state=DISCONNECTED, Disconnect Code=Node Failed to Startup Properly, Disconnect Reason=org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException, updateId=60] 2017-04-10 14:55:57,295 TRACE [main] o.a.curator.utils.DefaultTracerDriver Trace: GetChildrenBuilderImpl-Foreground - 0 ms 2017-04-10 14:55:57,295 TRACE [main] o.a.curator.utils.DefaultTracerDriver Trace: GetDataBuilderImpl-Foreground - 0 ms 2017-04-10 14:55:57,404 DEBUG [Notify Cluster of Node Status Change-1] o.a.n.c.p.i.StandardClusterCoordinationProtocolSender Notified hpce2r01n12.abc.com:9090 of status change NodeStatusChangeMessage[nodeId=hpce2r01n11.abc.com:9090, status=NodeConnectionStatus[nodeId=hpce2r01n11.abc.com:9090, state=DISCONNECTED, Disconnect Code=Node Failed to Startup Properly, Disconnect Reason=org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException, updateId=60]] 2017-04-10 14:55:57,405 ERROR [main] o.a.n.c.c.node.NodeClusterCoordinator Event Reported for hpce2r01n11.abc.com:9090 -- Node disconnected from cluster due to org.apache.nifi.cluster.ConnectionException: Failed to connect node to cluster due to: java.lang.NullPointerException 2017-04-10 14:55:57,405 INFO [main] o.a.n.c.l.e.CuratorLeaderElectionManager Cannot unregister Leader Election Role 'Primary Node' becuase that role is not registered 2017-04-10 14:55:57,407 WARN [main] org.apache.nifi.web.server.JettyServer Failed to start web server... shutting down. java.lang.IllegalStateException: Already closed or has not been started at com.google.common.base.Preconditions.checkState(Preconditions.java:173) ~[guava-18.0.jar:na] at org.apache.curator.framework.recipes.leader.LeaderSelector.close(LeaderSelector.java:270) ~[curator-recipes-2.11.0.jar:na] at org.apache.nifi.controller.leader.election.CuratorLeaderElectionManager.unregister(CuratorLeaderElectionManager.java:151) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.FlowController.setClustered(FlowController.java:3469) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.StandardFlowService.handleConnectionFailure(StandardFlowService.java:525) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.controller.StandardFlowService.load(StandardFlowService.java:497) ~[nifi-framework-core-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.web.server.JettyServer.start(JettyServer.java:831) ~[nifi-jetty-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.NiFi.<init>(NiFi.java:156) [nifi-runtime-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] at org.apache.nifi.NiFi.main(NiFi.java:262) [nifi-runtime-1.1.0.2.1.2.0-10.jar:1.1.0.2.1.2.0-10] 2017-04-10 14:55:57,407 INFO [Thread-1] org.apache.nifi.NiFi Initiating shutdown of Jetty web server... 2017-04-10 14:55:57,408 DEBUG [Thread-1] o.e.j.util.component.AbstractLifeCycle stopping org.eclipse.jetty.server.Server@16736040 2017-04-10 14:55:57,408 DEBUG [Thread-1] org.eclipse.jetty.server.Server doStop org.eclipse.jetty.server.Server@16736040 2017-04-10 14:55:57,409 DEBUG [Thread-1] org.eclipse.jetty.server.Server Graceful shutdown org.eclipse.jetty.server.Server@16736040 by 2017-04-10 14:55:57,410 DEBUG [Thread-1] o.e.j.util.component.AbstractLifeCycle stopping ServerConnector@546394ed{SSL,[ssl, http/1.1]}{hpce2r01n11.abc.com:9090} -- This message was sent by Atlassian JIRA (v6.3.15#6346)