Matthias Pohl created FLINK-33574: ------------------------------------- Summary: testRecoverAfterMultiplePersistsStateWithMultiPart andtestRecoverAfterMultiplePersistsStateWithMultiPart run into timeouts Key: FLINK-33574 URL: https://issues.apache.org/jira/browse/FLINK-33574 Project: Flink Issue Type: Bug Components: Connectors / FileSystem Affects Versions: 1.19.0 Reporter: Matthias Pohl
[https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=54446&view=logs&j=4eda0b4a-bd0d-521a-0916-8285b9be9bb5&t=2ff6d5fa-53a6-53ac-bff7-fa524ea361a9] Multiple connect_1 stages fail due to a timeout: {code:java} Nov 09 02:09:33 "main" #1 prio=5 os_prio=0 tid=0x00007efd5400b800 nid=0x7c0e waiting on condition [0x00007efd5ccd8000] Nov 09 02:09:33 java.lang.Thread.State: WAITING (parking) Nov 09 02:09:33 at sun.misc.Unsafe.park(Native Method) Nov 09 02:09:33 - parking to wait for <0x00000000b762d130> (a java.util.concurrent.CompletableFuture$Signaller) Nov 09 02:09:33 at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175) Nov 09 02:09:33 at java.util.concurrent.CompletableFuture$Signaller.block(CompletableFuture.java:1707) Nov 09 02:09:33 at java.util.concurrent.ForkJoinPool.managedBlock(ForkJoinPool.java:3323) Nov 09 02:09:33 at java.util.concurrent.CompletableFuture.waitingGet(CompletableFuture.java:1742) Nov 09 02:09:33 at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1908) Nov 09 02:09:33 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.awaitPendingPartUploadToComplete(RecoverableMultiPartUploadImpl.java:233) Nov 09 02:09:33 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.awaitPendingPartsUpload(RecoverableMultiPartUploadImpl.java:223) Nov 09 02:09:33 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.snapshotAndGetRecoverable(RecoverableMultiPartUploadImpl.java:152) Nov 09 02:09:33 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.snapshotAndGetCommitter(RecoverableMultiPartUploadImpl.java:122) Nov 09 02:09:33 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.snapshotAndGetCommitter(RecoverableMultiPartUploadImpl.java:56) Nov 09 02:09:33 at org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.closeForCommit(S3RecoverableFsDataOutputStream.java:178) Nov 09 02:09:33 at org.apache.flink.runtime.fs.hdfs.AbstractHadoopRecoverableWriterITCase.testResumeAfterMultiplePersist(AbstractHadoopRecoverableWriterITCase.java:375) Nov 09 02:09:33 at org.apache.flink.runtime.fs.hdfs.AbstractHadoopRecoverableWriterITCase.testResumeAfterMultiplePersistWithMultiPartUploads(AbstractHadoopRecoverableWriterITCase.java:330) Nov 09 02:09:33 at org.apache.flink.runtime.fs.hdfs.AbstractHadoopRecoverableWriterITCase.testRecoverAfterMultiplePersistsStateWithMultiPart(AbstractHadoopRecoverableWriterITCase.java:318) [...]{code} And {code:java} Nov 09 01:53:59 "main" #1 prio=5 os_prio=0 cpu=3732.81ms elapsed=1707.61s tid=0x00007f7bec028000 nid=0x3e5 waiting on condition [0x00007f7bf2c80000] Nov 09 01:53:59 java.lang.Thread.State: WAITING (parking) Nov 09 01:53:59 at jdk.internal.misc.Unsafe.park(java.base@11.0.19/Native Method) Nov 09 01:53:59 - parking to wait for <0x00000000aff7e730> (a java.util.concurrent.CompletableFuture$Signaller) Nov 09 01:53:59 at java.util.concurrent.locks.LockSupport.park(java.base@11.0.19/LockSupport.java:194) Nov 09 01:53:59 at java.util.concurrent.CompletableFuture$Signaller.block(java.base@11.0.19/CompletableFuture.java:1796) Nov 09 01:53:59 at java.util.concurrent.ForkJoinPool.managedBlock(java.base@11.0.19/ForkJoinPool.java:3128) Nov 09 01:53:59 at java.util.concurrent.CompletableFuture.waitingGet(java.base@11.0.19/CompletableFuture.java:1823) Nov 09 01:53:59 at java.util.concurrent.CompletableFuture.get(java.base@11.0.19/CompletableFuture.java:1998) Nov 09 01:53:59 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.awaitPendingPartUploadToComplete(RecoverableMultiPartUploadImpl.java:233) Nov 09 01:53:59 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.awaitPendingPartsUpload(RecoverableMultiPartUploadImpl.java:223) Nov 09 01:53:59 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.snapshotAndGetRecoverable(RecoverableMultiPartUploadImpl.java:152) Nov 09 01:53:59 at org.apache.flink.fs.s3.common.writer.RecoverableMultiPartUploadImpl.snapshotAndGetRecoverable(RecoverableMultiPartUploadImpl.java:56) Nov 09 01:53:59 at org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.persist(S3RecoverableFsDataOutputStream.java:167) Nov 09 01:53:59 at org.apache.flink.runtime.fs.hdfs.AbstractHadoopRecoverableWriterITCase.testResumeAfterMultiplePersist(AbstractHadoopRecoverableWriterITCase.java:351) Nov 09 01:53:59 at org.apache.flink.runtime.fs.hdfs.AbstractHadoopRecoverableWriterITCase.testResumeAfterMultiplePersistWithMultiPartUploads(AbstractHadoopRecoverableWriterITCase.java:330) Nov 09 01:53:59 at org.apache.flink.runtime.fs.hdfs.AbstractHadoopRecoverableWriterITCase.testRecoverFromIntermWithoutAdditionalStateWithMultiPart(AbstractHadoopRecoverableWriterITCase.java:312) [...]{code} -- This message was sent by Atlassian Jira (v8.20.10#820010)