Etienne Chauchot created FLINK-22587:
----------------------------------------

             Summary: Support aggregations in batch mode with DataStream API
                 Key: FLINK-22587
                 URL: https://issues.apache.org/jira/browse/FLINK-22587
             Project: Flink
          Issue Type: Improvement
          Components: API / DataStream
            Reporter: Etienne Chauchot


A pipeline like this *in batch mode* would output no data
{code:java}
stream.join(otherStream)
    .where(<KeySelector>)
    .equalTo(<KeySelector>)
    .window(GlobalWindows.create())
    .apply(<JoinFunction>)
{code}
Indeed the default trigger for GlobalWindow is NeverTrigger which never fires. 
If we set a _EventTimeTrigger_ it will fire with every element as the watermark 
will be set to +INF (batch mode) and will pass the end of the global window 
with each new element. A _ProcessingTimeTrigger_ never fires either and all 
elapsed time or delta based triggers would not be suited for batch.

Same goes for _reduce()_ instead of join().

So I guess we miss something for batch support with DataStream.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to