+1 totally agreed On Mon, Jun 22, 2015 at 5:32 PM, Gyula Fóra <[email protected]> wrote:
> Hey all, > Currently we have reduce and aggregation methods for non-grouped > DataStreams as well, which will produce local aggregates depending on the > parallelism of the operator. > > This behaviour is neither intuitive nor useful as it only produces sensible > results if the user specifically sets the parallelism to 1 which should not > be encouraged. > > I would like to remove these methods from the DataStream api and only keep > it for GroupedDataStreams and WindowedDataStream where the aggregation is > either executed per-key or per-window. > > Cheers, > Gyula >
