Re: 答复: [DISCUSS] Development of SQL OVER / Table API Row Windows for streaming tables

2017-02-06 Thread Fabian Hueske
Hi Jinkui Shi, Yuhong Hong, Sunjincheng, I'd suggest to discuss this on the actual JIRA issue. I think it would help to describe the design and status of the implementation. Thanks, Fabian 2017-02-06 3:24 GMT+01:00 shijinkui : > hi,Fabian, sunjincheng > > Today is the first workday of 2017 in C

[jira] [Created] (FLINK-5714) Use a builder pattern for creating CsvTableSource

2017-02-06 Thread Timo Walther (JIRA)
Timo Walther created FLINK-5714: --- Summary: Use a builder pattern for creating CsvTableSource Key: FLINK-5714 URL: https://issues.apache.org/jira/browse/FLINK-5714 Project: Flink Issue Type: Imp

[jira] [Created] (FLINK-5715) Asynchronous snapshotting for HeapKeyedStateBackend

2017-02-06 Thread Stefan Richter (JIRA)
Stefan Richter created FLINK-5715: - Summary: Asynchronous snapshotting for HeapKeyedStateBackend Key: FLINK-5715 URL: https://issues.apache.org/jira/browse/FLINK-5715 Project: Flink Issue Typ

[jira] [Created] (FLINK-5716) Make streaming SourceContexts aware of source idleness

2017-02-06 Thread Tzu-Li (Gordon) Tai (JIRA)
Tzu-Li (Gordon) Tai created FLINK-5716: -- Summary: Make streaming SourceContexts aware of source idleness Key: FLINK-5716 URL: https://issues.apache.org/jira/browse/FLINK-5716 Project: Flink

[jira] [Created] (FLINK-5717) NPE on SessionWindows with ContinuousProcessingTimeTrigger

2017-02-06 Thread Oriol Orellana (JIRA)
Oriol Orellana created FLINK-5717: - Summary: NPE on SessionWindows with ContinuousProcessingTimeTrigger Key: FLINK-5717 URL: https://issues.apache.org/jira/browse/FLINK-5717 Project: Flink Is

Re: New Flink team member - Kate Eri.

2017-02-06 Thread Katherin Eri
Hello, guys. Theodore, last week I started the review of the PR: https://github.com/apache/flink/pull/2735 related to *word2Vec for Flink*. During this review I have asked myself: why do we need to implement such a very popular algorithm like *word2vec one more time*, when there is already availab

Re: New Flink team member - Kate Eri.

2017-02-06 Thread Katherin Eri
Sorry, guys I need to finish this letter first. Full version of it will come shortly. пн, 6 февр. 2017 г. в 12:49, Katherin Eri : > Hello, guys. > Theodore, last week I started the review of the PR: > https://github.com/apache/flink/pull/2735 related to *word2Vec for Flink*. > > During this rev

Re: New Flink team member - Kate Eri.

2017-02-06 Thread Katherin Eri
Hello, guys. Theodore, last week I started the review of the PR: https://github.com/apache/flink/pull/2735 related to *word2Vec for Flink*. During this review I have asked myself: why do we need to implement such a very popular algorithm like *word2vec one more time*, when there is already avai

[jira] [Created] (FLINK-5718) Handle JVM Fatal Exceptions in Tasks

2017-02-06 Thread Stephan Ewen (JIRA)
Stephan Ewen created FLINK-5718: --- Summary: Handle JVM Fatal Exceptions in Tasks Key: FLINK-5718 URL: https://issues.apache.org/jira/browse/FLINK-5718 Project: Flink Issue Type: Improvement

Re: New Flink team member - Kate Eri.

2017-02-06 Thread Theodore Vasiloudis
Hell all, This is point that has come up in the past: Given the multitude of ML libraries out there, should we have native implementations in FlinkML or try to integrate other libraries instead? We haven't managed to reach a consensus on this before. My opinion is that there is definitely value i

[jira] [Created] (FLINK-5719) Let LatencyMarkers completely bypass operators / chains

2017-02-06 Thread Tzu-Li (Gordon) Tai (JIRA)
Tzu-Li (Gordon) Tai created FLINK-5719: -- Summary: Let LatencyMarkers completely bypass operators / chains Key: FLINK-5719 URL: https://issues.apache.org/jira/browse/FLINK-5719 Project: Flink

[jira] [Created] (FLINK-5720) Deprecate "Folding" in all of DataStream API

2017-02-06 Thread Aljoscha Krettek (JIRA)
Aljoscha Krettek created FLINK-5720: --- Summary: Deprecate "Folding" in all of DataStream API Key: FLINK-5720 URL: https://issues.apache.org/jira/browse/FLINK-5720 Project: Flink Issue Type:

[jira] [Created] (FLINK-5721) Add FoldingState to State Documentation

2017-02-06 Thread Aljoscha Krettek (JIRA)
Aljoscha Krettek created FLINK-5721: --- Summary: Add FoldingState to State Documentation Key: FLINK-5721 URL: https://issues.apache.org/jira/browse/FLINK-5721 Project: Flink Issue Type: Impro

[jira] [Created] (FLINK-5722) Implement DISTINCT as dedicated operator

2017-02-06 Thread Fabian Hueske (JIRA)
Fabian Hueske created FLINK-5722: Summary: Implement DISTINCT as dedicated operator Key: FLINK-5722 URL: https://issues.apache.org/jira/browse/FLINK-5722 Project: Flink Issue Type: Improvemen

RE: [jira] [Created] (FLINK-5722) Implement DISTINCT as dedicated operator

2017-02-06 Thread Radu Tudoran
Hi, Should we discuss also about the design of distinct for the stream case? It might go well in the context of tables as well as in the context of aggregates over windows... Dr. Radu Tudoran Senior Research Engineer - Big Data Expert IT R&D Division HUAWEI TECHNOLOGIES Duesseldorf GmbH Europe

RE: [jira] [Created] (FLINK-5656) Add processing time OVER ROWS BETWEEN UNBOUNDED PRECEDING aggregation to SQL

2017-02-06 Thread Stefano Bortoli
Hi Fabian, After working around the rule, I am moving towards the implementation of the Aggregation function. I started working extending DataStreamRel (for which I created a Java version). However, I noticed the LogicalWindowAggregate provides the list of aggregatedCalls and other parameters

Re: [DISCUSS] Development of SQL OVER / Table API Row Windows for streaming tables

2017-02-06 Thread Shaoxuan Wang
Sorry for the late response. Hi Jark, Thanks for raising a good question - my proposal “may not work for non-incremental aggregation (e.g. max, min, and median)”, but I have some different opinions. Yes, I have proposed a concept of “accumulate on getValue” in my UDAGG proposal https://goo.gl/6nt

[jira] [Created] (FLINK-5723) Use "Used" instead of "Initial" to make taskmanager tag more readable

2017-02-06 Thread Tao Wang (JIRA)
Tao Wang created FLINK-5723: --- Summary: Use "Used" instead of "Initial" to make taskmanager tag more readable Key: FLINK-5723 URL: https://issues.apache.org/jira/browse/FLINK-5723 Project: Flink Is

Re: [DISCUSS] Development of SQL OVER / Table API Row Windows for streaming tables

2017-02-06 Thread Fabian Hueske
Hi Shaoxuan, I think you are right. The UDAGG interface allows to collect all input values in the accumulation buffer, so values can also be removed from there in case of an retraction. This makes all functions retractable at the cost of materializing a potentially large accumulation buffer, but t

Re: [jira] [Created] (FLINK-5656) Add processing time OVER ROWS BETWEEN UNBOUNDED PRECEDING aggregation to SQL

2017-02-06 Thread Fabian Hueske
Hi Stefano, I don't think we should integrate this with LogicalWindowAggregate which is meant for GroupBy windows and not Over windows. Moreover, LogicalWindowAggregate is on the logical plan level but we need to implement a physical operator, i.e., a DataStreamRel. Calcite parses the SQL query in

Re: [RESULT][VOTE] Release Apache Flink 1.2.0 (RC3)

2017-02-06 Thread Robert Metzger
The release is out: http://flink.apache.org/news/2017/02/06/release-1.2.0.html Also, there's a tweet: https://twitter.com/ApacheFlink/status/828633701332422658 On Fri, Feb 3, 2017 at 11:47 PM, Fabian Hueske wrote: > Thanks Robert for managing the release and everybody who contributed! > > Cheers

[Discuss] Organizing Documentation for Configuration Options

2017-02-06 Thread Greg Hogan
Hi devs, Flink's Configuration page [1] has grown intimidatingly long and complex. Options are described across three main sections: common options (single section), advanced options (multiple sections), and full reference. The trailing "background" section further describes the most impactful opt

[DISCUSS] Planning Release 1.3

2017-02-06 Thread Robert Metzger
Hi, according to our recent time-based releases discussion, I came up with the following deadlines for the upcoming 1.3 release: *Feature freeze (branch forking)*: 1 May 2017 *Code freeze (first voting RC)*: 15 May 2017 *Release date*: 26 May 2017 I will try to post into this thread monthly to

[jira] [Created] (FLINK-5724) Error in the 'Zipping Elements' docs

2017-02-06 Thread Fokko Driesprong (JIRA)
Fokko Driesprong created FLINK-5724: --- Summary: Error in the 'Zipping Elements' docs Key: FLINK-5724 URL: https://issues.apache.org/jira/browse/FLINK-5724 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-5725) Support JOIN between two streams in the SQL API

2017-02-06 Thread Haohui Mai (JIRA)
Haohui Mai created FLINK-5725: - Summary: Support JOIN between two streams in the SQL API Key: FLINK-5725 URL: https://issues.apache.org/jira/browse/FLINK-5725 Project: Flink Issue Type: New Featu

[jira] [Created] (FLINK-5726) Add the RocketMQ plugin for the Apache Spark

2017-02-06 Thread Longda Feng (JIRA)
Longda Feng created FLINK-5726: -- Summary: Add the RocketMQ plugin for the Apache Spark Key: FLINK-5726 URL: https://issues.apache.org/jira/browse/FLINK-5726 Project: Flink Issue Type: Task

[jira] [Created] (FLINK-5727) Unify some API of batch and stream TableEnvironment

2017-02-06 Thread Kurt Young (JIRA)
Kurt Young created FLINK-5727: - Summary: Unify some API of batch and stream TableEnvironment Key: FLINK-5727 URL: https://issues.apache.org/jira/browse/FLINK-5727 Project: Flink Issue Type: Impro

[jira] [Created] (FLINK-5728) FlinkKafkaProducer should flush on checkpoint by default

2017-02-06 Thread Tzu-Li (Gordon) Tai (JIRA)
Tzu-Li (Gordon) Tai created FLINK-5728: -- Summary: FlinkKafkaProducer should flush on checkpoint by default Key: FLINK-5728 URL: https://issues.apache.org/jira/browse/FLINK-5728 Project: Flink