Re: How do network transmissions in Flink work?

2015-07-20 Thread Niklas Semmler
Hello Ufuk, thank you very much for the answer. You helped me to bring a great deal of context into the problem :). I have one final question: What is a good indicator that the transfer of data contained in a single ResultPartition is finished? Is there any? Or can the amount of retrieved da

Re: Communicating between nodes at runtime

2015-07-20 Thread Sachin Goel
@Max, broadcast variables have to be declared before the program is executed. I want to be able to do something whereby I can send data inside a map operation to all nodes. This would perhaps have an effect similar to the recent discussion on making Accumulators available before Job completion. [I

[jira] [Created] (FLINK-2382) Live Metric Reporting Does Not Work for Two-Input StreamTasks

2015-07-20 Thread Aljoscha Krettek (JIRA)
Aljoscha Krettek created FLINK-2382: --- Summary: Live Metric Reporting Does Not Work for Two-Input StreamTasks Key: FLINK-2382 URL: https://issues.apache.org/jira/browse/FLINK-2382 Project: Flink

[jira] [Created] (FLINK-2381) Possible class not found Exception on failed partition producer

2015-07-20 Thread Ufuk Celebi (JIRA)
Ufuk Celebi created FLINK-2381: -- Summary: Possible class not found Exception on failed partition producer Key: FLINK-2381 URL: https://issues.apache.org/jira/browse/FLINK-2381 Project: Flink Is

Re: Communicating between nodes at runtime

2015-07-20 Thread Stephan Ewen
You are probably looking for a parameter server tool. How about setting up one of these memory grids to use that? Apache Ignite, or Apache Geode, or one of those. On Fri, Jul 17, 2015 at 6:19 PM, Sachin Goel wrote: > Hi all > Is it possible to send and receive key,value pairs at runtime? I woul

Re: Communicating between nodes at runtime

2015-07-20 Thread Maximilian Michels
Hi Sachin, Do you know about Broadcast Variables? They allow you to transfer a DataSet to all nodes. https://ci.apache.org/projects/flink/flink-docs-master/apis/programming_guide.html#broadcast-variables Let us know if that fits your needs. Cheers, Max On Fri, Jul 17, 2015 at 6:19 PM, Sachin G

[jira] [Created] (FLINK-2380) Allow to configure default FS for file inputs

2015-07-20 Thread Ufuk Celebi (JIRA)
Ufuk Celebi created FLINK-2380: -- Summary: Allow to configure default FS for file inputs Key: FLINK-2380 URL: https://issues.apache.org/jira/browse/FLINK-2380 Project: Flink Issue Type: Improveme

Re: Benchmarks of Flink, supporting Flink in BigDataBench

2015-07-20 Thread Fabian Hueske
Hi, welcome to the Flink community and thanks for including Flink into your benchmark suite! That's really exciting news :-) Most of the jobs that you listed in your preliminary plan are available as example programs in Flink's code base [1]. However, you should know, that these examples are NOT

Re: Benchmarks of Flink, supporting Flink in BigDataBench

2015-07-20 Thread Stephan Ewen
Hi! Thanks for reaching out and adding Flink to BigDataBench. The plan you sent looks like a nice first draft. It is pretty much batch jobs. Here are a few ideas what you could add as batch jobs: - Joins are something people seem do a lot with these systems, so a 2-3 table join would be a nice

Benchmarks of Flink, supporting Flink in BigDataBench

2015-07-20 Thread Xinhui Tian
Hello, everyone. I'm a PhD student from the Institute of Computing Technology, Chinese Academy of Sciences. Our team has released a benchmark for big data systems called BigDataBench, which has become an industry-standard big data benchmark in China. You can find our work on this website: http://p