[ 
https://issues.apache.org/jira/browse/NIFI-9559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17483171#comment-17483171
 ] 

Shawn Weeks edited comment on NIFI-9559 at 1/27/22, 2:20 PM:
-------------------------------------------------------------

Just wanted to add some details, first no I'm not using TLS between NiFi and 
Zookeeper nor any other authentication for Zookeeper.  Second this morning 
after one of my nodes went down I went and reviewed the firewall logs and I 
noticed that when the issue occurs NiFi no longer even tries to talk to 
Zookeeper. After it starts throwing the error it just never connects. I still 
haven't had any luck forcing the issue to occur. I think you'd have to silently 
terminate the connection between Zookeeper and NiFi. Just virtually pulling the 
network plug on a NiFi VM doesn't seem to cause this.

[~thenatog]


was (Author: absolutesantaja):
Just wanted to add some details, first no I'm not using TLS between NiFi and 
Zookeeper nor any other authentication for Zookeeper.  Second this morning 
after one of my nodes went down I went and reviewed the firewall logs and I 
noticed that when the issue occurs NiFi no longer even tries to talk to 
Zookeeper. After it starts throwing the error it just never connects. I still 
haven't had any luck forcing the issue to occur. I think you'd have to silently 
terminate the connection between Zookeeper and NiFi. Just virtually pulling the 
network plug on a NiFi VM doesn't seem to cause this.

> Zookeeper Client Can't Reconnect - Session timeout has elapsed while SUSPENDED
> ------------------------------------------------------------------------------
>
>                 Key: NIFI-9559
>                 URL: https://issues.apache.org/jira/browse/NIFI-9559
>             Project: Apache NiFi
>          Issue Type: Bug
>            Reporter: Shawn Weeks
>            Assignee: Nathan Gough
>            Priority: Minor
>         Attachments: nifi_and_zookeeper_logs.txt
>
>
> It's possible this is fixed in 1.15.2 but I don't see any commits that would 
> have resolved it. After a loss of connection to Zookeeper a NiFi node never 
> successfully reconnects to the Zookeeper or the Cluster and instead returns 
> errors about no Cluster Coordinator and a Session timeout has elapsed while 
> SUSPENDED repeatedly until you restart NiFi.
> The error described is the same one at 
> https://issues.apache.org/jira/browse/CURATOR-405 however that patch has been 
> in NiFi for several versions now.
> NiFi version is 1.14.0 and Zookeeper 3.6.3



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to