[ https://issues.apache.org/jira/browse/IGNITE-21523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Roman Puchkovskiy updated IGNITE-21523: --------------------------------------- Description: After running the test repeatedly, the following happened: after a restart of 2 clusterServices, one of the restarted cluster services (port 3344) and one of not restarted cluster services (port 3345) could not 'see' each other due to metadata obtaining timeout: [2024-02-12T14:16:54,968][WARN ][sc-cluster-3345-1409][MetadataStore] [default:inrt_tr_3345:13b7bdae21a14c86@127.0.1.1:3345][56ad9288-7ce4-4f67-b11c-8e1eaf6d6612] Timeout getting GetMetadataResp from 127.0.1.1:3344 within 1000 ms, cause: java.util.concurrent.TimeoutException: Did not observe any item or terminal signal within 1000ms in 'source(MonoDefer)' (and no fallback has been configured) [2024-02-12T14:16:54,968][WARN ][sc-cluster-3345-1409][MembershipProtocol] [default:inrt_tr_3345:13b7bdae21a14c86@127.0.1.1:3345][updateMembership][SYNC] Skipping to add/update member: \{m: default:inrt_tr_3344:788fea0a8f9b4251@127.0.1.1:3344, s: ALIVE, inc: 0}, due to failed fetchMetadata call (cause: java.util.concurrent.TimeoutException: Did not observe any item or terminal signal within 1000ms in 'source(MonoDefer)' (and no fallback has been configured)) was: After running the test repeatedly, the following happened: after a restart of 2 clusterServices, one of the restarted cluster services (port 3344) and one of not restarted cluster services (port 3345) could not 'see' each other due to metadata obtaining timeout: [2024-02-12T14:16:54,968][WARN ][sc-cluster-3345-1409][MetadataStore] [default:inrt_tr_3345:13b7bdae21a14c86@127.0.1.1:3345][56ad9288-7ce4-4f67-b11c-8e1eaf6d6612] Timeout getting GetMetadataResp from 127.0.1.1:3344 within 1000 ms, cause: java.util.concurrent.TimeoutException: Did not observe any item or terminal signal within 1000ms in 'source(MonoDefer)' (and no fallback has been configured) [2024-02-12T14:16:54,968][WARN ][sc-cluster-3345-1409][MembershipProtocol] [default:inrt_tr_3345:13b7bdae21a14c86@127.0.1.1:3345][updateMembership][SYNC] Skipping to add/update member: \{m: default:inrt_tr_3344:788fea0a8f9b4251@127.0.1.1:3344, s: ALIVE, inc: 0}, due to failed fetchMetadata call (cause: java.util.concurrent.TimeoutException: Did not observe any item or terminal signal within 1000ms in 'source(MonoDefer)' (and no fallback has been configured)) > ItNodeRestartsTest.testRestarts is flaky > ---------------------------------------- > > Key: IGNITE-21523 > URL: https://issues.apache.org/jira/browse/IGNITE-21523 > Project: Ignite > Issue Type: Bug > Reporter: Roman Puchkovskiy > Assignee: Roman Puchkovskiy > Priority: Major > Labels: ignite-3 > Fix For: 3.0.0-beta2 > > > After running the test repeatedly, the following happened: after a restart of > 2 clusterServices, one of the restarted cluster services (port 3344) and one > of not restarted cluster services (port 3345) could not 'see' each other due > to metadata obtaining timeout: > [2024-02-12T14:16:54,968][WARN ][sc-cluster-3345-1409][MetadataStore] > [default:inrt_tr_3345:13b7bdae21a14c86@127.0.1.1:3345][56ad9288-7ce4-4f67-b11c-8e1eaf6d6612] > Timeout getting GetMetadataResp from 127.0.1.1:3344 within 1000 ms, cause: > java.util.concurrent.TimeoutException: Did not observe any item or terminal > signal within 1000ms in 'source(MonoDefer)' (and no fallback has been > configured) > [2024-02-12T14:16:54,968][WARN ][sc-cluster-3345-1409][MembershipProtocol] > [default:inrt_tr_3345:13b7bdae21a14c86@127.0.1.1:3345][updateMembership][SYNC] > Skipping to add/update member: \{m: > default:inrt_tr_3344:788fea0a8f9b4251@127.0.1.1:3344, s: ALIVE, inc: 0}, due > to failed fetchMetadata call (cause: java.util.concurrent.TimeoutException: > Did not observe any item or terminal signal within 1000ms in > 'source(MonoDefer)' (and no fallback has been configured)) > -- This message was sent by Atlassian Jira (v8.20.10#820010)