Flink on AWS EMR unable to access S3 files

2016-07-14 Thread Ashish Aggarwal
Hi Flink Devs, I am facing a wierd issue with my Flink Job that uses DataSet APIs to read parquet files from s3. I have Flink Cluster deployed on AWS EMR and my jar works with charm when gets submitted through JobManager UI. However I need a programatic way to submit the jars, hence I use flink-cl

[DISCUSS] Move JIRA creation emails to separate list?

2016-07-14 Thread Theodore Vasiloudis
Hello all, I'm not sure if this has been discussed before (or if this is a an option when joining the list in which case ignore this) but I would like to suggest moving all the issue creation emails to a list separate from dev. Right now if I come back to the list after say a week most of the ac

[jira] [Created] (FLINK-4219) Quote PDSH opts in start-cluster.sh

2016-07-14 Thread Greg Hogan (JIRA)
Greg Hogan created FLINK-4219: - Summary: Quote PDSH opts in start-cluster.sh Key: FLINK-4219 URL: https://issues.apache.org/jira/browse/FLINK-4219 Project: Flink Issue Type: Bug Compone

[jira] [Created] (FLINK-4218) Sporadic "java.lang.RuntimeException: Error triggering a checkpoint..." causes task restarting

2016-07-14 Thread Sergii Koshel (JIRA)
Sergii Koshel created FLINK-4218: Summary: Sporadic "java.lang.RuntimeException: Error triggering a checkpoint..." causes task restarting Key: FLINK-4218 URL: https://issues.apache.org/jira/browse/FLINK-4218

Re: [Discussion] Query Regarding Operator chaining

2016-07-14 Thread Robert Metzger
Aljoscha is right. Multiple consumers in the same consumer group can not read from the same partition. You'll need to create a Kafka topic with more partitions to have higher parallelism. On Wed, Jul 6, 2016 at 10:45 AM, Aljoscha Krettek wrote: > Hi, > unfortunately the reading of one Kafka part

[jira] [Created] (FLINK-4217) Gelly drivers should read CSV values as strings

2016-07-14 Thread Greg Hogan (JIRA)
Greg Hogan created FLINK-4217: - Summary: Gelly drivers should read CSV values as strings Key: FLINK-4217 URL: https://issues.apache.org/jira/browse/FLINK-4217 Project: Flink Issue Type: Improveme

Re: Restructuring Javadoc and Scaladoc for libraries

2016-07-14 Thread Robert Metzger
Hi Chiwan, I think that's something we need to address. Probably the scaladoc plugin is not configured correctly everywhere. On Thu, Jul 14, 2016 at 3:59 AM, Chiwan Park wrote: > Hi all, > > I just noticed some scaladocs (Gelly Scala API, Streaming Scala API, and > FlinkML) are missing in scala

Re: [PROPOSAL] Structure the Flink Open Source Development

2016-07-14 Thread Robert Metzger
I added you to the DataStream API. On Fri, Jun 17, 2016 at 5:36 PM, Kostas Kloudas wrote: > Hello, > > You can also add me to the DataStream API. > > Kostas > > > On Jun 16, 2016, at 7:02 PM, Robert Metzger wrote: > > > > Cool, thank you. > > > > So now we have at least one shepherd for each co

[jira] [Created] (FLINK-4216) WordWithCount example with Java has wrong generics type

2016-07-14 Thread Serhiy Boychenko (JIRA)
Serhiy Boychenko created FLINK-4216: --- Summary: WordWithCount example with Java has wrong generics type Key: FLINK-4216 URL: https://issues.apache.org/jira/browse/FLINK-4216 Project: Flink I

Helping Spread the Word about Apachecon EU 2016

2016-07-14 Thread Sharan Foga
Hi Everyone I'm forwarding the following message on behalf of Rich Bowen and the Apachecon team === As you are aware, we are holding ApacheCon in Seville in November. While this seems like a long way away, it is critical that we get on people's calendar now, so that they can plan, get b

[jira] [Created] (FLINK-4215) timestamp of StreamRecord is lost in WindowOperator

2016-07-14 Thread Cody (JIRA)
Cody created FLINK-4215: --- Summary: timestamp of StreamRecord is lost in WindowOperator Key: FLINK-4215 URL: https://issues.apache.org/jira/browse/FLINK-4215 Project: Flink Issue Type: Bug Com