[
https://issues.apache.org/jira/browse/IGNITE-25142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17944758#comment-17944758
]
Roman Puchkovskiy commented on IGNITE-25142:
--------------------------------------------
There is a deadlock between PartitionReplicaLifecycleManager and TableManager
which manifests itself on node start.
# PRLM starts a zone X; by doing this, it takes a write lock on the zone
# TableManager attemtps to start tables of node X, and for doing so it has to
take a read lock on zone X, so it is blocked by PRLM
# But replica starts cannot be initiated until all tables are started, so
TableManager holds PRLM, resulting in a deadlock
The deadlock can be eliminated if we don't acquire zone write locks on node
start in PRLM. These locks are not needed at node start as another mutual
exclusion mechanism (namely that 'first start tables, only then start
replicas') is in place.
> ItZoneDataReplicationTest.testLocalRaftLogReapplication is flaky
> ----------------------------------------------------------------
>
> Key: IGNITE-25142
> URL: https://issues.apache.org/jira/browse/IGNITE-25142
> Project: Ignite
> Issue Type: Bug
> Reporter: Roman Puchkovskiy
> Assignee: Roman Puchkovskiy
> Priority: Major
> Labels: ignite-3
> Time Spent: 10m
> Remaining Estimate: 0h
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)