[ https://issues.apache.org/jira/browse/IGNITE-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17091828#comment-17091828 ]
Ivan Daschinskiy edited comment on IGNITE-10720 at 4/24/20, 7:01 PM: --------------------------------------------------------------------- [~akalashnikov] [~agoncharuk] [~zstan] +1 For deadlock. Changed implementation to synchronous and {{IgniteSequentialNodeCrashRecoveryTest}} doesn't hang locally. On master this test on my laptop hangs almost in all runs. {code} [2020-04-24 21:49:49,383][WARN ][checkpoint-runner-#215%db.IgniteSequentialNodeCrashRecoveryTest0%][IgniteTestResources] Possible failure suppressed accordingly to a configured handler [hnd=NoOpFailureHandler [super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet [SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], failureCtx=FailureContext [type=SYSTEM_CRITICAL_OPERATION_TIMEOUT, err=class o.a.i.IgniteException: Checkpoint read lock acquisition has been timed out.]] class org.apache.ignite.IgniteException: Checkpoint read lock acquisition has been timed out. at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.failCheckpointReadLock(GridCacheDatabaseSharedManager.java:1746) at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.checkpointReadLock(GridCacheDatabaseSharedManager.java:1683) at org.apache.ignite.internal.processors.cache.persistence.GridCacheOffheapManager$GridCacheDataStore.init0(GridCacheOffheapManager.java:1696) at org.apache.ignite.internal.processors.cache.persistence.GridCacheOffheapManager$GridCacheDataStore.fullSize(GridCacheOffheapManager.java:2069) at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$Checkpointer.lambda$fillCacheGroupState$1(GridCacheDatabaseSharedManager.java:4190) at org.apache.ignite.internal.util.IgniteUtils.lambda$wrapIgniteFuture$3(IgniteUtils.java:11412) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) {code} was (Author: ivandasch): [~akalashnikov] [~agoncharuk] [~zstan] +1 For deadlock. Changed implementation to synchronous and {{IgniteSequentialNodeCrashRecoveryTest}} doesn't hang locally. On master this test on my laptop hangs almost in all runs. > Decrease time to save metadata during checkpoint > ------------------------------------------------ > > Key: IGNITE-10720 > URL: https://issues.apache.org/jira/browse/IGNITE-10720 > Project: Ignite > Issue Type: Improvement > Reporter: Anton Kalashnikov > Assignee: Anton Kalashnikov > Priority: Major > Fix For: 2.8 > > Time Spent: 40m > Remaining Estimate: 0h > > Looks like it's not neccessery save all metadata(like free list) under write > checkpoint lock because sometimes it's too long. -- This message was sent by Atlassian Jira (v8.3.4#803005)