Re: Generate watermarks per key in a KeyedStream

2017-11-09 Thread Derek VerLee
We are contending with the same issue, as it happens.  We have dozens, and potentially down the line, may need to deal with thousands of different "time systems" as you put it, and may not be know at compile time or job start time.  In a practical sense, how

Re: Generate watermarks per key in a KeyedStream

2017-11-09 Thread Shailesh Jain
Thanks for your reply, Xingcan. On Wed, Nov 8, 2017 at 10:42 PM, Xingcan Cui wrote: > Hi Shailesh, > > actually, the watermarks are generated per partition, but all of them will > be forcibly aligned to the minimum one during processing. That is decided > by the semantics of

Re: Generate watermarks per key in a KeyedStream

2017-11-08 Thread Xingcan Cui
Hi Shailesh, actually, the watermarks are generated per partition, but all of them will be forcibly aligned to the minimum one during processing. That is decided by the semantics of watermark and KeyedStream, i.e., the watermarks belong to a whole stream and a stream is made up of different

Generate watermarks per key in a KeyedStream

2017-11-08 Thread Shailesh Jain
Hi, I'm working on implementing a use case wherein different physical devices are sending events, and due to network/power issues, there can be a delay in receiving events at Flink source. One of the operators within the flink job is the Pattern operator, and there are certain patterns which are

Re: Watermarks per key

2017-02-20 Thread jganoff
. So is that what's technically preventing assigning timestamps per key from working? I'm curious to hear Aljoscha's thoughts on watermark management across keys. Thanks! -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Watermarks-per-key

Re: Watermarks per key

2017-02-15 Thread Fabian Hueske
Hi Jordan, it is not possible to generate watermarks per key. This feature has been requested a couple of times but I think there are no plans to implement that. As far as I understand, the management of watermarks would be quite expensive (maintaining several watermarks, purging watermarks

Watermarks per key

2017-02-14 Thread Jordan Ganoff
Hi, I’m designing a streaming job whose elements need to be windowed by event time across a large set of keys. All elements are read from the same source. Event time progresses independently across keys. Is it possible to assign timestamps, and thus generate independent watermarks, per keyed

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-23 Thread Aljoscha Krettek
> AggregateReadings()); > > DataStream aggregatesPerHour = readings > .timeWindow(Time.hours(1)) > .allowedLateness(Time.hours(2)) > .apply(new ReadingAggregate(), new AggregateReadings(), new > AggregateReadings()); > > > > Feedback is

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-23 Thread kaelumania
DataStream aggregatesPerHour = readings >> .timeWindow(Time.hours(1)) >> .allowedLateness(Time.hours(2)) >> .apply(new ReadingAggregate(), new AggregateReadings(), new >> AggregateReadings()); >> >> >> Feedback is very welcome. >> >

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-23 Thread Aljoscha Krettek
t; I am pretty new to flink, could you elaborate on a possible solution? We > can guarantee good ordering by sensor_id, thus watermarking by key would be > the only reasonable way for us ( > *sensorData.keyBy('id').timeWindow(1.minute).sum('value')*), could I do > my own watermarking after*se

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-21 Thread Stephan Epping
han > > > > >> On 11 Nov 2016, at 00:29, Fabian Hueske-2 [via Apache Flink User Mailing >> List archive.] <[hidden email] >> <http://user/SendEmail.jtp?type=node=10179=0>> wrote: >> > >> Hi Stephan, >> >> I just wrote an

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-21 Thread Aljoscha Krettek
[via Apache Flink User Mailing > List archive.] <[hidden email] > <http:///user/SendEmail.jtp?type=node=10179=0>> wrote: > > Hi Stephan, > > I just wrote an answer to your SO question. > > Best, Fabian > > > 2016-11-10 11:01 GMT+01:00 Stephan Epping &l

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-17 Thread kaelumania
lt;[hidden email] > >: > Hello, > > I found this question in the Nabble archive > (http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Maintaining-watermarks-per-key-instead-of-per-operator-instance-tp7288.html > > <http://apache-flink-user-mailing

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-15 Thread Stephan Epping
t; about Snapshots? Will they work with the mentioned design? >> >> kind regards, >> Stephan > >>> On 11 Nov 2016, at 00:39, Fabian Hueske <>> href="x-msg://10/user/SendEmail.jtp?type=nodenode=10094i=1 >>> " >>> target=&qu

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-14 Thread Aljoscha Krettek
; On 11 Nov 2016, at 00:39, Fabian Hueske < href="x-msg://10/user/SendEmail.jtp?type=nodenode=10094i=1" > target="_top" rel="nofollow" link="external" class="">[hidden email]> wrote: > > Hi Stephan, > > I just wrote an answer

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-14 Thread kaelumania
6-11-10 11:01 GMT+01:00 Stephan Epping <[hidden email] >> >: >> Hello, >> >> I found this question in the Nabble archive >> (http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Maintaining-watermarks-per-key-instead-of-per-operator-instance-tp7

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-14 Thread Fabian Hueske
1:00 Stephan Epping <stephan.epp...@zweitag.de>: > >> Hello, >> >> I found this question in the Nabble archive ( >> http://apache-flink-user-mailing-list-archive.2336050.n4. >> nabble.com/Maintaining-watermarks-per-key-instead-of-per- >> operator-insta

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-13 Thread Stephan Epping
n. > > Best, Fabian > > 2016-11-10 11:01 GMT+01:00 Stephan Epping <stephan.epp...@zweitag.de > <mailto:stephan.epp...@zweitag.de>>: > Hello, > > I found this question in the Nabble archive > (http://apache-flink-user-mailing-list-archive.2336050.n4.nabbl

Re: Maintaining watermarks per key, instead of per operator instance

2016-11-10 Thread Fabian Hueske
ning- > watermarks-per-key-instead-of-per-operator-instance-tp7288.html) but was > unable/dont know how to reply. > > Here is my question regarding the mentioned thread: > > Hello, > > I have similar requirements (see StackOverflor http:// > stackoverflow.com/quest

Re: Maintaining watermarks per key, instead of per operator instance

2016-06-02 Thread Kanstantsin Kamkou
Hi Aljoscha! Is it possible somehow to use the RichXFunction in CEP? The task is pretty similar, but I have to ignore once the next triggered event for the same key. On Wed, Jun 1, 2016 at 2:54 PM, Aljoscha Krettek wrote: > Hi, > yeah, in that case per-key watermarks would

Re: Maintaining watermarks per key, instead of per operator instance

2016-06-02 Thread leon_mclare
Hi again Aljoscha, understood. Thanks for the link. I really like the straightforward approach concerning storing state. It makes things very easy. The improvements are very interesting, particularly the composite triggers. That would significantly improve flexibility. Kind regards Leon 1.

Re: Maintaining watermarks per key, instead of per operator instance

2016-06-01 Thread Aljoscha Krettek
Hi, yeah, in that case per-key watermarks would be useful for you. I won't be possible to add such a feature, though, due to the (possibly) dynamic nature of the key space and how watermark tracking works. You should be able to implement it with relatively low overhead using a RichFlatMapFunction

Re: Maintaining watermarks per key, instead of per operator instance

2016-05-31 Thread leon_mclare
Hi Aljoscha, thanks for the speedy reply. I am processing measurements delivered by smart meters. I use windows to gather measurements and calculate values such as average consumption. The key is simply the meter ID. The challenge is that meters may undergo network partitioning, under which

Maintaining watermarks per key, instead of per operator instance

2016-05-31 Thread leon_mclare
My use case primarily concerns applying transformations per key, with the keys remaining fixed throughout the topology. I am using event time for my windows. The problem i am currently facing is that watermarks in windows propagate per operator instance, meaning the operator event time