Re: How do I ensure binary comparisons are being used?

2017-01-02 Thread ljwagerfield
Any insights on this? Thanks, Lawrence -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/How-do-I-ensure-binary-comparisons-are-being-used-tp10806p10819.html Sent from the Apache Flink User Mailing List archive. mailing list archive at

Re: Set Parallelism and keyBy

2017-01-02 Thread Jamie Grier
Domink, This should work just as you expect. Maybe the output of the print is just misleading you. The print() operation will still have a parallelism of two but the flatMap() with have a parallelism of 16 and all data elements with the same key will get routed to the same host. Any sequence

Re: Flink streaming questions

2017-01-02 Thread Jamie Grier
Hi Henri, #1 - This is by design. Event time advances with the slowest input source. If there are input sources that generate no data this is indistinguishable from a slow source. Kafka topics where some partitions receive no data are a problem in this regard -- but there isn't a simple

Re: Programmatically get live values of accumulators

2017-01-02 Thread Jamie Grier
Hi Gwenhael, I think what you actually want is to use the Apache Flink metrics interface. See the following: https://ci.apache.org/projects/flink/flink-docs-release-1.1/apis/metrics.html Sending metrics to StatsD is supported out-of-the-box. -Jamie On Mon, Jan 2, 2017 at 1:34 AM, Gwenhael

Re: Hi, There is possibly an issue with EventTimeSessionWindows where a gap is specified for considering items in the same session. Here the logic is, if two adjacent items have a difference in event

2017-01-02 Thread Jamie Grier
If there is never a gap between elements larger than the session gap -- the window never ending would be the correct behavior. So, if this is the case with some data stream I would not suggest to use session windows at all -- or I would use a smaller session gap. Another alternative would be to

1.1.4 IntelliJ Problem

2017-01-02 Thread Stephan Epping
Hi, I am getting this error running my tests with 1.1.4 inside intellij ide. java.lang.NoSuchMethodError:

Re: Events are assigned to wrong window

2017-01-02 Thread Nico
Hi Aljoscha, thank you for having a look. Actually there is not too much code based on timestamps: stream .keyBy("id") .map(...) .filter(...) .map(...) .keyBy("areaID") .map(new KeyExtractor()) .keyBy("f1.areaID","f0.sinterval")

Hi, There is possibly an issue with EventTimeSessionWindows where a gap is specified for considering items in the same session. Here the logic is, if two adjacent items have a difference in event time

2017-01-02 Thread Sujit Sakre
Hi, We are using Flink 1.1.4 version. There is possibly an issue with EventTimeSessionWindows where a gap is specified for considering items in the same session. Here the logic is, if two adjacent items have a difference in event timestamps of more than the gap then the items are considered to

Programmatically get live values of accumulators

2017-01-02 Thread Gwenhael Pasquiers
Hi, and best wishes for the year to come :) I'd like to be able to programmatically get the (live) values of accumulators in order to send them using a statsd (or another) client in the JobManager of a yarn-deployed application. I say live because I'd like to use that in streaming (24/7)

Flink streaming questions

2017-01-02 Thread Henri Heiskanen
Hi, I have few questions related to Flink streaming. I am on 1.2-SNAPSHOT and what I would like to accomplish is to have a stream that reads data from multiple kafka topics, identifies user sessions, uses an external user user profile to enrich the data, evaluates an script to produce session