[ https://issues.apache.org/jira/browse/FLINK-6428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16012541#comment-16012541 ]
Fabian Hueske commented on FLINK-6428: -------------------------------------- {{SELECT DISTINCT}} is already supported. There is no need to add a dedicated operator / translation path for this. During logical optimization {{SELECT DISTINCT a, b, c FROM t}} is rewritten to {{SELECT a, b, c FROM t GROUP BY a, b, c}} and translated as such. IMO, we can close this issue. What do you think [~sunjincheng121]? > Add support DISTINCT in dataStream SQL > -------------------------------------- > > Key: FLINK-6428 > URL: https://issues.apache.org/jira/browse/FLINK-6428 > Project: Flink > Issue Type: New Feature > Components: Table API & SQL > Reporter: sunjincheng > Assignee: sunjincheng > > Add support DISTINCT in dataStream SQL as follow: > DATA: > {code} > (name, age) > (kevin, 28), > (sunny, 6), > (jack, 6) > {code} > SQL: > {code} > SELECT DISTINCT age FROM MyTable" > {code} > RESULTS: > {code} > 28, 6 > {code} > To DataStream: > {code} > inputDS > .keyBy() // KeyBy on all fields > .flatMap() // Eliminate duplicate data > {code} > [~fhueske] do we need this feature? -- This message was sent by Atlassian JIRA (v6.3.15#6346)