[ https://issues.apache.org/jira/browse/FLINK-7001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16464248#comment-16464248 ]
Philipp Grulich commented on FLINK-7001: ---------------------------------------- Hi [~walterddr] , we published recently this paper about our new Window Operator: http://www.user.tu-berlin.de/powibol/assets/publications/traub-scotty-icde-2018.pdf It would definitely provide a huge performance improvement in contrast to the current Flink implementation. I think a FLIP was not written yet. Best, Philipp > Improve performance of Sliding Time Window with pane optimization > ----------------------------------------------------------------- > > Key: FLINK-7001 > URL: https://issues.apache.org/jira/browse/FLINK-7001 > Project: Flink > Issue Type: Improvement > Components: DataStream API > Reporter: Jark Wu > Assignee: Jark Wu > Priority: Major > > Currently, the implementation of time-based sliding windows treats each > window individually and replicates records to each window. For a window of 10 > minute size that slides by 1 second the data is replicated 600 fold (10 > minutes / 1 second). We can optimize sliding window by divide windows into > panes (aligned with slide), so that we can avoid record duplication and > leverage the checkpoint. > I will attach a more detail design doc to the issue. > The following issues are similar to this issue: FLINK-5387, FLINK-6990 -- This message was sent by Atlassian JIRA (v7.6.3#76005)