Re: [DISCUSS] Putting Flink user names / logos on the homepage

2016-07-04 Thread Jark Wu
It’s a great idea! I would be happy if I can help something. In addition, maybe we can move the full “Powered By” wiki page to the website to reduce external link. - Jark Wu > 在 2016年7月4日,下午11:15,Stephan Ewen mailto:se...@apache.org>> > 写道: > > We have the "Powered By" page already. > My na

Issues while interacting with DynamoDB

2016-07-04 Thread Deepak Jha
Hi All, We've flink (1.0.2) HA setup on AWS cloud and are using IAM roles to interact with S3 (S3a as suggested in flink best practices) and DynamoDB. While trying to interact with DynamoDB to perform key-value pair lookup from one of the operator we are running into the following issue. def putI

Re: [DISCUSS] Releasing Flink 1.1.0

2016-07-04 Thread Robert Metzger
+1 to do a RC0 this week, but the master-forking with RC1. I would like to reduce the time we need to apply patches to multiple branches. @Aljoscha: I was running into the same issue on EMR when I used Flink w/ RocksDB recently, so I agree ;) On Mon, Jul 4, 2016 at 3:35 PM, Aljoscha Krettek wr

[jira] [Created] (FLINK-4150) Problem with Blobstore in Yarn HA setting on recovery after cluster shutdown

2016-07-04 Thread Stefan Richter (JIRA)
Stefan Richter created FLINK-4150: - Summary: Problem with Blobstore in Yarn HA setting on recovery after cluster shutdown Key: FLINK-4150 URL: https://issues.apache.org/jira/browse/FLINK-4150 Project:

[Discuss] Query regarding Flink 1.1 setup

2016-07-04 Thread Vinay Patil
Hi, Can you please tell how do I download flink1.1-SNAPSHOT for running the job on cluster, on the flink site the latest I get is 1.0.3 (currently I am using this), any other link from where I can get 1.1 ? Regards, Vinay Patil

Re: [Discussion] Query Regarding Operator chaining

2016-07-04 Thread Vinay Patil
Thanks a lot guys, this helps to understand better Regards, Vinay Patil On Mon, Jul 4, 2016 at 8:43 PM, Stephan Ewen wrote: > Just to be sure: Each *subtask* has one thread - so for each task, there > are as many parallel threads (distributed across nodes) as your parallelism > indicates. > > F

Re: [DISCUSS] Putting Flink user names / logos on the homepage

2016-07-04 Thread Kostas Tzoumas
I'd be happy to check with some of these folks On Mon, Jul 4, 2016 at 8:15 AM, Stephan Ewen wrote: > We have the "Powered By" page already. > My naive assumption was that companies that have clearance to put their > name there are also okay with a logo. > After all, it is only displaying the sam

Re: DagConnection tempMode vs breakPipeline property

2016-07-04 Thread CPC
Thank you ufuk. Yes it helps alot. Yardımın için teşekkürler. On Jul 4, 2016 2:02 PM, "Ufuk Celebi" wrote: > The data exchange mode has been introduced recently as a replacement > for the pipeline break logic, which was buggy. I'm not too familiar > with the optimizer, but I would expect everythi

Re: [DISCUSS] Putting Flink user names / logos on the homepage

2016-07-04 Thread Stephan Ewen
We have the "Powered By" page already. My naive assumption was that companies that have clearance to put their name there are also okay with a logo. After all, it is only displaying the same information in a more prominent place. On Mon, Jul 4, 2016 at 5:06 PM, Márton Balassi wrote: > I do like

Re: [Discussion] Query Regarding Operator chaining

2016-07-04 Thread Stephan Ewen
Just to be sure: Each *subtask* has one thread - so for each task, there are as many parallel threads (distributed across nodes) as your parallelism indicates. For most cases, having long chains and then a higher parallelism is a good choice. Cases where individual functions (MapFunction, etc) do

Re: [DISCUSS] Putting Flink user names / logos on the homepage

2016-07-04 Thread Márton Balassi
I do like the idea, that seems to be the trend now - the Bigtop community had a similar initiative recently. [1] Helps dealing with the "Is it mature enough?" question. :) [1] http://kaiyzen.github.io/bigtop/ On Mon, Jul 4, 2016 at 5:00 PM, Ufuk Celebi wrote: > I would like that! +1 > > On Mon,

Re: [DISCUSS] Putting Flink user names / logos on the homepage

2016-07-04 Thread Ufuk Celebi
I would like that! +1 On Mon, Jul 4, 2016 at 4:59 PM, Aljoscha Krettek wrote: > Hi, > If we have some high-profile users that a worthwhile putting there and that > are OK with us putting up their logos then this would be great. > > Cheers, > Aljoscha > > On Mon, 4 Jul 2016 at 16:58 Stephan Ewen

Re: [Discussion] Query Regarding Operator chaining

2016-07-04 Thread Aljoscha Krettek
Hi, chaining is useful to minimize communication overhead. But in your case you might benefit more from having good cluster utilization. There seems to be a tradeoff. Maybe you can run some easy tests to see how it behaves for you. Cheers, Aljoscha On Mon, 4 Jul 2016 at 16:28 Vinay Patil wrote:

Re: [DISCUSS] Putting Flink user names / logos on the homepage

2016-07-04 Thread Aljoscha Krettek
Hi, If we have some high-profile users that a worthwhile putting there and that are OK with us putting up their logos then this would be great. Cheers, Aljoscha On Mon, 4 Jul 2016 at 16:58 Stephan Ewen wrote: > Hi all! > > I was wondering if we want to put some names / logos of Flink users on t

[DISCUSS] Putting Flink user names / logos on the homepage

2016-07-04 Thread Stephan Ewen
Hi all! I was wondering if we want to put some names / logos of Flink users on the page. A bunch of Apache projects do that, for example, have a look at the Storm home page http://storm.apache.org >From that section of users/logos, we would link to the "Powered By" section in the wiki. That woul

[jira] [Created] (FLINK-4149) Fix Serialization of NFA in AbstractKeyedCEPPatternOperator

2016-07-04 Thread Aljoscha Krettek (JIRA)
Aljoscha Krettek created FLINK-4149: --- Summary: Fix Serialization of NFA in AbstractKeyedCEPPatternOperator Key: FLINK-4149 URL: https://issues.apache.org/jira/browse/FLINK-4149 Project: Flink

Re: [Discussion] Query Regarding Operator chaining

2016-07-04 Thread Vinay Patil
Thanks, so is operator chaining useful in terms of utilizing the resources or we should keep the chaining to minimal use, say 3-4 operators and disable chaining ? I am worried because I am seeing all the operators in one box on flink UI. Regards, Vinay Patil On Mon, Jul 4, 2016 at 7:13 PM, Aljo

Re: [Discussion] Query Regarding Operator chaining

2016-07-04 Thread Aljoscha Krettek
Hi, this is true, yes. If the number of Kafka partitions is less than the parallelism then some of the sources might not be utilized. If you insert a rebalance after the sources you should be able to utilize all the downstream operations equally. Cheers, Aljoscha On Mon, 4 Jul 2016 at 11:13 Vinay

Re: [DISCUSS] Releasing Flink 1.1.0

2016-07-04 Thread Aljoscha Krettek
IMHO, the fix for this should also go in: https://issues.apache.org/jira/browse/FLINK-4115. This is blocking for users that want to use the RocksDB backend or FsStateBackend on Amazon EMR with S3. There is already an open PR that I'm hoping to get in this week. On Mon, 4 Jul 2016 at 13:48 Ufuk Ce

[jira] [Created] (FLINK-4148) incorrect calculation distance in QuadTree

2016-07-04 Thread Alexey Diomin (JIRA)
Alexey Diomin created FLINK-4148: Summary: incorrect calculation distance in QuadTree Key: FLINK-4148 URL: https://issues.apache.org/jira/browse/FLINK-4148 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-4147) Consider moving the file sources from the StreamExecutionEnvironment to the flink-connector-filesystem

2016-07-04 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-4147: - Summary: Consider moving the file sources from the StreamExecutionEnvironment to the flink-connector-filesystem Key: FLINK-4147 URL: https://issues.apache.org/jira/browse/FLINK-

Re: [DISCUSS] Releasing Flink 1.1.0

2016-07-04 Thread Ufuk Celebi
Thanks for the feedback. I would like to create a preview RC0 this week like we did for the last releases, too. In past major releases, we needed to create many release candidates, often for fixing just some small issues. I would like to speed up the release process by collecting as many issues as

Re: DagConnection tempMode vs breakPipeline property

2016-07-04 Thread Ufuk Celebi
The data exchange mode has been introduced recently as a replacement for the pipeline break logic, which was buggy. I'm not too familiar with the optimizer, but I would expect everything that goes back to the DataExchangeMode to be correct. The rest should be an artifact of the old pipeline breaker

Re: Exception but successfully build

2016-07-04 Thread Ufuk Celebi
Hey David, could this be related: http://stackoverflow.com/questions/1124788/java-unresolved-compilation-problem? – Ufuk On Mon, Jul 4, 2016 at 9:22 AM, David Herzog wrote: > Dear Support, > > I make small print outs in: org.apache.flink.runtime.jobmanager.Jobmanager > to better understand how it

Re: [Discussion] Query Regarding Operator chaining

2016-07-04 Thread Vinay Patil
Just an update, the task will be executed by multiple threads , my bad I asked the wrong way. Can you please clarify other things. Out of 8 node only 3 of them are getting utilized, reading the data from Kafka , does it mean that the Kafka partitions are set to less number ? What if we use rescal

Exception but successfully build

2016-07-04 Thread David Herzog
Dear Support, I make small print outs in: org.apache.flink.runtime.jobmanager.Jobmanager to better understand how it works. Since building takes up a lot of time run this command to save me some time.: mvn package -Dmaven.test.skip=true After successfully building Flink I try to run: bin/flink r