Re: Dynamically repartitioned sources

2016-03-11 Thread Robert Metzger
Hi Maxim, you can implement a source for the system you are describing without changing the parallelism of Flink. What you have to do is implement your own data sources for Flink. I would start by implementing the ParallelSourceFunction interface, where each parallel source instance is reading fro

Re: Query regarding using storm topology in Flink

2016-03-11 Thread Robert Metzger
I agree with Victor. Such errors are often caused by scala version mixups (2.10 vs 2.11). On Wed, Mar 9, 2016 at 2:42 AM, Vitor Vieira wrote: > Hi Samiksha > > Could you post here what dependencies are you using. It looks like you > could be using outdated dependencies, or making unnecessary ref

XGBoost on DataFlow and Flink

2016-03-11 Thread Tianqi Chen
Hi Flink Developers I am sending this email to let you know about XGBoost4J, a package that we are planning to announce next week . Here is the draft version of the post https://github.com/dmlc/xgboost/blob/master/doc/jvm/xgboost4j-intro.md In short, XGBoost is a machine learning package t

Re: zipWithIndex in Python API

2016-03-11 Thread Chesnay Schepler
The subtaskIndex is not currently exposed to the python operator. Fortunately this can be changed very easily: On the java side, within PythonStreamer.startPython() the python process is started and several parameters are transferred (L.129++) using stdin/-out. These parameters are received on

zipWithIndex in Python API

2016-03-11 Thread Shannon Quinn
Hi all, I'm interested in getting involved the Python API development. The first use-case I've encountered in my work is that of zipWithIndex, so I started looking into how to go about implementing that. It looks like the core of it involves being able to uniquely identify what worker you're

Re: Flink-1.0.0 JobManager is not running in Docker Container on AWS

2016-03-11 Thread Stephan Ewen
Hi Deepak! We can currently not split the bind address and advertised address, because the Akka library only accepts packages sent explicitly to the bind address (not sure why Akka has this artificial limitation, but it is there). Can you bridge the container IP address to be visible from the out

Re: Flink-1.0.0 JobManager is not running in Docker Container on AWS

2016-03-11 Thread Ufuk Celebi
Hey Deepak! Your description of Flink's behaviour is correct. To summarize: # Host Address If you specify a host address as an argument to the JVM (via jobmanager.sh or the start-cluster.sh scripts) then that one is used. If you don't, it falls back to the value configured in flink-conf.yaml (wh

[jira] [Created] (FLINK-3609) Revisit selection of Calcite rules

2016-03-11 Thread Fabian Hueske (JIRA)
Fabian Hueske created FLINK-3609: Summary: Revisit selection of Calcite rules Key: FLINK-3609 URL: https://issues.apache.org/jira/browse/FLINK-3609 Project: Flink Issue Type: Task C

[jira] [Created] (FLINK-3608) ImmutableSettings error in ElasticsearchSink

2016-03-11 Thread Carlos Gonzalez de Zarate (JIRA)
Carlos Gonzalez de Zarate created FLINK-3608: Summary: ImmutableSettings error in ElasticsearchSink Key: FLINK-3608 URL: https://issues.apache.org/jira/browse/FLINK-3608 Project: Flink

[jira] [Created] (FLINK-3607) Decrease default forkCount for tests

2016-03-11 Thread Maximilian Michels (JIRA)
Maximilian Michels created FLINK-3607: - Summary: Decrease default forkCount for tests Key: FLINK-3607 URL: https://issues.apache.org/jira/browse/FLINK-3607 Project: Flink Issue Type: Impr

[jira] [Created] (FLINK-3606) Expose Configuration from RichFunctions

2016-03-11 Thread JIRA
Márton Balassi created FLINK-3606: - Summary: Expose Configuration from RichFunctions Key: FLINK-3606 URL: https://issues.apache.org/jira/browse/FLINK-3606 Project: Flink Issue Type: Improveme

Re: Update the link in Job.java for Flink examples

2016-03-11 Thread Maximilian Michels
Hi Janardhan, I just fixed that on the master and the release-1.0 branch because you mentioned this on the user mailing list. Thanks, Max On Fri, Mar 11, 2016 at 10:32 AM, Aljoscha Krettek wrote: > Hi, > could you please open a Jira issue for that. > > Cheers, > Aljoscha >> On 11 Mar 2016, at 0

Re: Update the link in Job.java for Flink examples

2016-03-11 Thread Aljoscha Krettek
Hi, could you please open a Jira issue for that. Cheers, Aljoscha > On 11 Mar 2016, at 06:41, janardhan shetty wrote: > > We need to update the Job.java file of 1.0 to hold the correct links of > flink examples: > > Currently it is pointing to > http://flink.apache.org/docs/latest/examples.html

Re: a question about operatorChain.broadcastCheckpointBarrier

2016-03-11 Thread Aljoscha Krettek
Yes, you are right. In other tasks the lock is not strictly necessary but we have one method in the base class that covers all tasks. > On 11 Mar 2016, at 05:04, Ma GuoWei wrote: > > it is very helpful。 thanks a lot > > now i know the souce task needs the lock。 > > Other tasks like the Stream