[ https://issues.apache.org/jira/browse/FLINK-7001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16161075#comment-16161075 ]
Philipp Grulich commented on FLINK-7001: ---------------------------------------- Hey, what is the current state of this? Over the last month, I worked on a similar research project. We implemented this kind of aggregation sharing for all time-based window operators. Maybe it would make sense to work together on this. Best, Philipp > Improve performance of Sliding Time Window with pane optimization > ----------------------------------------------------------------- > > Key: FLINK-7001 > URL: https://issues.apache.org/jira/browse/FLINK-7001 > Project: Flink > Issue Type: Improvement > Components: DataStream API > Reporter: Jark Wu > Assignee: Jark Wu > Fix For: 1.4.0 > > > Currently, the implementation of time-based sliding windows treats each > window individually and replicates records to each window. For a window of 10 > minute size that slides by 1 second the data is replicated 600 fold (10 > minutes / 1 second). We can optimize sliding window by divide windows into > panes (aligned with slide), so that we can avoid record duplication and > leverage the checkpoint. > I will attach a more detail design doc to the issue. > The following issues are similar to this issue: FLINK-5387, FLINK-6990 -- This message was sent by Atlassian JIRA (v6.4.14#64029)