Re: could you please add me Contributor list?

2015-11-25 Thread Robert Metzger
I added you as a contributor. On Wed, Nov 25, 2015 at 7:29 AM, jun aoki wrote: > Hi Henry, thank you for helping. My id is jaoki > > On Tue, Nov 24, 2015 at 9:47 PM, Henry Saputra > wrote: > > > Hi Jun, > > > > What is your JIRA username? > > > > But for now, you can always work on JIRA issue w

Re: Union a data stream with a product of itself

2015-11-25 Thread Gyula Fóra
Yes, I am not sure if this the intentional behaviour. I think you are supposed to be able to do the things you described. stream.union(stream.map(..)) and things like this are fair operations. Also maybe stream.union(stream) should just give stream instead of an error. Could someone comment on th

Re: Null Pointer Exception in tests but only in COLLECTION mode

2015-11-25 Thread Maximilian Michels
Hi Martin, Great. Thanks for the fix! Cheers, Max On Tue, Nov 24, 2015 at 7:40 PM, Martin Junghanns wrote: > Hi Max, > > fixed in https://github.com/apache/flink/pull/1396 > > Best, > Martin > > > On 24.11.2015 13:46, Maximilian Michels wrote: >> >> Hi André, hi Martin, >> >> This looks very mu

Re: Union a data stream with a product of itself

2015-11-25 Thread Stephan Ewen
"stream.union(stream.map(..))" should definitely be possible. Not sure why this is not permitted. "stream.union(stream)" would contain each element twice, so should either give an error or actually union (or duplicate) elements... Stephan On Wed, Nov 25, 2015 at 10:42 AM, Gyula Fóra wrote: >

RE: The null in Flink

2015-11-25 Thread Li, Chengxiang
Hi In this mail list, there are some discussions about null value handling in Flink, and I saw several related JIRAs as well(like FLINK-2203, FLINK-2210), but unfortunately, got reverted due to immature design, and no further action since then. I would like to pick this topic up here, as it's qu

[jira] [Created] (FLINK-3077) Add "version" command to CliFrontend for showing the version of the installation

2015-11-25 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-3077: - Summary: Add "version" command to CliFrontend for showing the version of the installation Key: FLINK-3077 URL: https://issues.apache.org/jira/browse/FLINK-3077 Proj

Re: The null in Flink

2015-11-25 Thread Timo Walther
Hi Chengxiang, I totally agree that the Table API should fully support NULL values. The Table API is a logical API and therefore we should be as close to ANSI SQL as possible. Rows need to be nullable in the near future. 2. i, ii, iii and iv sound reasonable. But v, vi and vii sound to much

Re: Union a data stream with a product of itself

2015-11-25 Thread Bruecke, Christoph
Hi, the operation “stream.union(stream.map(id))” is equivalent to “stream.union(stream)” isn’t it? So it might also duplicate the data. - Christoph > On 25 Nov 2015, at 11:24, Stephan Ewen wrote: > > "stream.union(stream.map(..))" should definitely be possible. Not sure why > this is not per

Re: Union a data stream with a product of itself

2015-11-25 Thread Gyula Fóra
Well it kind of depends on what definition of union are we using. If this is a union in a set theoretical way we can argue that the union of a stream with itself should be the same stream because it contains exactly the same elements with the same timestamps and lineage. On the other hand stream a

[jira] [Created] (FLINK-3078) JobManager does not shutdown when checkpointed jobs are running

2015-11-25 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-3078: - Summary: JobManager does not shutdown when checkpointed jobs are running Key: FLINK-3078 URL: https://issues.apache.org/jira/browse/FLINK-3078 Project: Flink

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Robert Metzger
+1 - Build a maven project with the staging repository - started Flink on YARN on a CDH 5.4.5 / Hadoop 2.6.0-cdh5.4.5 cluster with YARN and HDFS HA - ran some kafka (0.8.2.0) read / write experiments - job cancellation with yarn is working ;) I found the following issue while testing: https://iss

Re: Union a data stream with a product of itself

2015-11-25 Thread Vasiliki Kalavri
So, do we all agree that the current behavior is not correct? Shall I open a JIRA about this? On 25 November 2015 at 13:58, Gyula Fóra wrote: > Well it kind of depends on what definition of union are we using. If this > is a union in a set theoretical way we can argue that the union of a stream

Re: Union a data stream with a product of itself

2015-11-25 Thread Gyula Fóra
Yes, please Vasiliki Kalavri ezt írta (időpont: 2015. nov. 25., Sze, 14:37): > So, do we all agree that the current behavior is not correct? Shall I open > a JIRA about this? > > On 25 November 2015 at 13:58, Gyula Fóra wrote: > > > Well it kind of depends on what definition of union are we usi

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Stephan Ewen
+1 - License and Notice are good - ran all tests (including manual tests) work for hadoop 2.3.0 - Scala 2.10 - ran all tests for hadoop 2.7.0 - Scala 2.11 - ran all examples, several on larger external data - checked web frontend - checked quickstart archetypes On Tue, Nov 24, 2015 at 1:54

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Till Rohrmann
Checked checksums for src release and Hadoop 2.7 Scala 2.10 release Checked binaries in source release - contains ./flink-staging/flink-avro/src/test/resources/testdata.avro License - no new files added which are relevant for licensing Build Flink and run tests from source release for Hadoop 2.5

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Stephan Ewen
@Till I think the avro test data file is okay, the "no binaries" policy refers to binary executables, as far as I know. On Wed, Nov 25, 2015 at 2:54 PM, Till Rohrmann wrote: > Checked checksums for src release and Hadoop 2.7 Scala 2.10 release > > Checked binaries in source release > - contains

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Till Rohrmann
Alright, then I withdraw my remark concerning testdata.avro. On Wed, Nov 25, 2015 at 2:56 PM, Stephan Ewen wrote: > @Till I think the avro test data file is okay, the "no binaries" policy > refers to binary executables, as far as I know. > > On Wed, Nov 25, 2015 at 2:54 PM, Till Rohrmann > wrot

Re: Apache Tinkerpop & Geode Integration?

2015-11-25 Thread James Thornton
Hi Vasia - Yes, a FlinkGraphComputer should be a straight-forward first step. Also, on the Apache Tinkerpop dev mailing list, Marko thought it might be cool if there was a "Graph API" similar to the "Table API" -- hooking in Gremlin to Flink's fluent API would give Flink users a full graph query l

[jira] [Created] (FLINK-3079) Add utility for measuring the raw read throughput from a Kafka topic

2015-11-25 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-3079: - Summary: Add utility for measuring the raw read throughput from a Kafka topic Key: FLINK-3079 URL: https://issues.apache.org/jira/browse/FLINK-3079 Project: Flink

[jira] [Created] (FLINK-3080) Cannot union a data stream with a product of itself

2015-11-25 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-3080: Summary: Cannot union a data stream with a product of itself Key: FLINK-3080 URL: https://issues.apache.org/jira/browse/FLINK-3080 Project: Flink Issue Type:

Re: Union a data stream with a product of itself

2015-11-25 Thread Vasiliki Kalavri
Here's the issue: https://issues.apache.org/jira/browse/FLINK-3080 -V. On 25 November 2015 at 14:38, Gyula Fóra wrote: > Yes, please > > Vasiliki Kalavri ezt írta (időpont: 2015. nov. > 25., Sze, 14:37): > > > So, do we all agree that the current behavior is not correct? Shall I > open > > a J

Re: Apache Tinkerpop & Geode Integration?

2015-11-25 Thread Vasiliki Kalavri
Hi James, I've just subscribed to the Tinkerpop dev mailing list. Could you please send a reply to the thread, so then I can reply to it? I'm not sure how I can reply to the thread otherwise... I also saw that there is a grafos.ml project thread. I could also provide some input there :) Thanks! -

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Aljoscha Krettek
+1 I ran an example with a custom operator that processes high-volume kafka input/output and has a large state size. I ran this on 10 GCE nodes. > On 25 Nov 2015, at 14:58, Till Rohrmann wrote: > > Alright, then I withdraw my remark concerning testdata.avro. > > On Wed, Nov 25, 2015 at 2:56 P

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Ufuk Celebi
+1 - Verified hashes and signatures - Ran example jobs on YARN with vanilla Hadoop vesions (on 4 GCE nodes): * 2.7.1 with Flink Hadoop 2.7 binary, Scala 2.10 and 11 * 2.6.2 with Flink Hadoop 2.6 binary, Scala 2.10 * 2.4.1 with Flink Hadoop 2.4 binary, Scala 2.10 * 2.3.0 with Flink Hadoop 2

[ANNOUNCE] CFP open for ApacheCon North America 2016

2015-11-25 Thread Rich Bowen
Community growth starts by talking with those interested in your project. ApacheCon North America is coming, are you? We are delighted to announce that the Call For Presentations (CFP) is now open for ApacheCon North America. You can submit your proposed sessions at http://events.linuxfoundation.o

Re: [VOTE] Release Apache Flink 0.10.1 (release-0.10.0-rc1)

2015-11-25 Thread Henry Saputra
+1 LICENSE file looks good in source artifact NOTICE file looks good in source artifact Signature file looks good in source artifact Hash files looks good in source artifact No 3rd party executables in source artifact Source compiled All tests are passed Run standalone mode test app - Henry On M

RE: The null in Flink

2015-11-25 Thread Li, Chengxiang
Thanks, Timo. We may put the NULL related function support to SQL API, but for Scalar expression and Boolean expression, it already been supported in Table API, without NULL value handling support, query with Scalar expression and Boolean expression would fail while encounter NULL value. Thank