[jira] [Commented] (FLINK-9428) Allow operators to flush data on checkpoint pre-barrier

ASF GitHub Bot (JIRA) Thu, 24 May 2018 02:03:04 -0700

    [ 
https://issues.apache.org/jira/browse/FLINK-9428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16488679#comment-16488679
 ]


ASF GitHub Bot commented on FLINK-9428:
---------------------------------------

Github user aljoscha commented on a diff in the pull request:

    https://github.com/apache/flink/pull/6066#discussion_r190510791
  
    --- Diff: 
flink-streaming-java/src/main/java/org/apache/flink/streaming/api/operators/StreamOperator.java
 ---
    @@ -93,6 +93,20 @@
        //  state snapshots
        // 
------------------------------------------------------------------------
     
    +   /**
    +    * This method is called when the operator should do a snapshot, before 
it emits its
    +    * own checkpoint barrier. This method is intended not for any actual 
state persistence,
    +    * but only for emitting some data before emitting the checkpoint 
barrier.
    +    *
    +    * <p><b>Important:</b> This method should not be used for any actual 
state snapshot logic, because
    +    * it will inherently be within the synchronous part of the operator's 
checkpoint. If heavy work is done
    +    * withing this method, it will affect latency and downstream 
checkpoint alignments.
    --- End diff --
    
    typo: withing -> within


> Allow operators to flush data on checkpoint pre-barrier
> -------------------------------------------------------
>
>                 Key: FLINK-9428
>                 URL: https://issues.apache.org/jira/browse/FLINK-9428
>             Project: Flink
>          Issue Type: New Feature
>          Components: State Backends, Checkpointing
>            Reporter: Stephan Ewen
>            Assignee: Stephan Ewen
>            Priority: Major
>             Fix For: 1.6.0
>
>
> Some operators maintain some small transient state that may be inefficient to 
> checkpoint, especially when it would need to be checkpointed also in a 
> re-scalable way.
> An example are opportunistic pre-aggregation operators, which have small the 
> pre-aggregation state that is frequently flushed downstream.
> Rather that persisting that state in a checkpoint, it can make sense to flush 
> the data downstream upon a checkpoint, to let it be part of the downstream 
> operator's state.
> This feature is sensitive, because flushing state has a clean implication on 
> the downstream operator's checkpoint alignment. However, used with care, and 
> with the new back-pressure-based checkpoint alignment, this feature can be 
> very useful.
> Because it is sensitive, I suggest to make this only an internal feature 
> (accessible to operators) and NOT expose it in the public API at this point.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (FLINK-9428) Allow operators to flush data on checkpoint pre-barrier

Reply via email to