[
https://issues.apache.org/jira/browse/CURATOR-723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Amr Elazhary updated CURATOR-723:
---------------------------------
Attachment: Status Change Logs Anonymized.csv
> [
> -
>
> Key: CURATOR-723
> URL: https://issues.apache.org/jira/browse/CURATOR-723
> Project: Apache Curator
> Issue Type: Bug
> Components: Framework
> Affects Versions: 4.3.0
> Reporter: Amr Elazhary
> Priority: Major
> Attachments: Status Change Logs Anonymized.csv
>
>
> We have a jar application
> we are using the below "curator-framework-4.3.0" and "zookeeper-3.5.7" on the
> client servers for such application
> !https://issues.apache.org/jira/s/xd97tr/820010/13pdxe5/_/images/fav-jsw.png!
> CURATOR-525 There is a race condition in Curator which might lead to fake
> SUSPENDED event and ruin CuratorFrameworkImpl inner state - ASF JIRA (State
> Bug)
> !https://issues.apache.org/jira/s/xd97tr/820010/13pdxe5/_/images/fav-jsw.png!
> CURATOR-526 Error logged for valid config - "Invalid config event received:
> \{properties}" - ASF JIRA (logging Bug)
> Above two Bugs Fixed in 5.0.0/5.2.0
>
> We have the below logs on our application (probably a combination of these
> two bugs) which ends up on consuming a lot of CPU and a failure after (due to
> open/close connections)
> {noformat}
> 2024-10-18T06:35:27.855+11:00 - o.a.c.f.state.ConnectionStateManager -
> State change: SUSPENDED
> 2024-10-18T06:35:27.855+11:00 - - New connection state : SUSPENDED
> 2024-10-18T06:35:27.855+11:00 - - Connection to server has been suspended..
> 2024-10-18T06:35:28.638+11:00 - org.apache.curator.ConnectionState -
> Negotiated session timeout: 100002024-10-18T06:35:28.639+11:00 -
> o.a.c.f.state.ConnectionStateManager - State change: RECONNECTED
> 2024-10-18T06:35:28.639+11:00 - o.a.c.f.imps.CuratorFrameworkImpl -
> Clearing sleep for 0 operations
> 2024-10-18T06:35:28.639+11:00 - - New connection state : RECONNECTED
> 2024-10-18T06:35:28.639+11:00 - - Connection to server has been
> unsuspended...2024-10-18T06:35:28.640+11:00 -
> o.a.c.framework.imps.EnsembleTracker - New config event received:
> {}2024-10-18T06:35:28.640+11:00 - - Reconnect worker starting
> 2024-10-18T06:35:28.640+11:00 - - Attempting to reconnect to the Herd
> 2024-10-18T06:35:28.640+11:00 - - Waiting for connection to server
> 2024-10-18T06:35:28.640+11:00 - - Connected to server
> 2024-10-18T06:35:28.640+11:00 - - Reconnect worker
> completed2024-10-18T06:35:28.647+11:00 - o.a.c.framework.imps.EnsembleTracker
> - Invalid config event received: {}{noformat}
>
> +*Our Questions*+
> 1.For the logging Bug "Invalid config event received" , we need to upgrade
> the Curator as per https://issues.apache.org/jira/browse/CURATOR-526 please
> confirm?
> 2. For the reconnections , we are suspecting that it is related to BUG
> https://issues.apache.org/jira/browse/CURATOR-525 , however we need to
> confirm this (how?), also what is the trigger for this BUG knowing that we
> have another jar applications having the same Curator/Zookeeper version and
> they are not facing this BUG if our assumption is correct ? please advice the
> trigger here and why it is happening on some applications and not happening
> for others. also do we have any other solution rather than upgrading the
> curator version if this is the root cause?
> 3. Can we upgrade the curator to version 5.2.0 without upgrading the Zookeper
> knowing that we use Zookeeper "zookeeper-3.5.7"?
--
This message was sent by Atlassian Jira
(v8.20.10#820010)