rkhachatryan opened a new pull request #12478:
URL: https://github.com/apache/flink/pull/12478


   __This is a draft PR. It depends on #12457 and should not be merged before__
   
   ## What is the purpose of the change
   
   Before FLINK-17218 there were two problems:
   1. Exceeding `ChannelStateWriterImpl.maxCheckpoints`
   1. Race condition with out of order barriers (see 
be8fbcf506b8dd38e5425cf772a55f033f0962b0 of this PR)
   
   FLINK-17218 (24ff415f1b76392f75dea7c3538558d24fcb7058) fixed the first one 
but introduced a new race condition when netty thread modifies a 
ChannelStateWriter map used by the task thread.
   
   This PR reverts FLINK-17218 and addresses the issues above.
   
   ## Brief change log
   1. Revert 24ff415f1b76392f75dea7c3538558d24fcb7058 
`"[FLINK-17218][checkpointing] Ensuring that ChannelStateWriter aborts previous 
checkpoints before a new checkpoint`
   1. Abort channel state write if checkpoint is subsumed - using future 
callback and task thread - fix race condition
   1. Increase `ChannelStateWriterImpl.DEFAULT_MAX_CHECKPOINTS` - fix the 
original problem of `FLINK-17218`
   1. Ignore out of order checkpoints in `SubtaskCheckpointCoordinator` - fix a 
more subtle race condition
   1. Unignore `UnalignedCheckpointITCase`
   
   ## Verifying this change
   
   Unignore `UnalignedCheckpointITCase`.
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): no
     - The public API, i.e., is any changed class annotated with 
`@Public(Evolving)`: no
     - The serializers: no
     - The runtime per-record code paths (performance sensitive): no
     - Anything that affects deployment or recovery: JobManager (and its 
components), Checkpointing, Kubernetes/Yarn/Mesos, ZooKeeper: no
     - The S3 file system connector: no
   
   ## Documentation
   
     - Does this pull request introduce a new feature? no
     - If yes, how is the feature documented? not applicable
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to