[jira] [Commented] (FLINK-14807) Add Table#collect api for fetching data to client

Jiangjie Qin (Jira) Thu, 27 Feb 2020 01:17:28 -0800


    [ 
https://issues.apache.org/jira/browse/FLINK-14807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17046376#comment-17046376
 ]


Jiangjie Qin commented on FLINK-14807:
--------------------------------------

{quote}Your first point is actually the root of complexity. According to 
current analysis, JM should be the one who is responsible to retrieve the 
results, this would make things more complex and not straightforward, e.g. 
lacking of communication channel between tasks and JM.
{quote}
The communication channel is always needed whether it is exactly once or not. 
Exactly once semantic (EOS) requires additional protocol on top of the basic 
communication channel. When I say it is simpler, I meant the protocol would be 
simpler.

One straightforward implementation will be letting JM act like a proxy and just 
forward the request from client to the sinks in a round robin manner with lazy 
acks. In this case EOS is much easier given that we can assume the client is 
always up and fetching.

> Add Table#collect api for fetching data to client
> -------------------------------------------------
>
>                 Key: FLINK-14807
>                 URL: https://issues.apache.org/jira/browse/FLINK-14807
>             Project: Flink
>          Issue Type: New Feature
>          Components: Table SQL / API
>    Affects Versions: 1.9.1
>            Reporter: Jeff Zhang
>            Priority: Major
>              Labels: usability
>             Fix For: 1.11.0
>
>         Attachments: table-collect.png
>
>
> Currently, it is very unconvinient for user to fetch data of flink job unless 
> specify sink expclitly and then fetch data from this sink via its api (e.g. 
> write to hdfs sink, then read data from hdfs). However, most of time user 
> just want to get the data and do whatever processing he want. So it is very 
> necessary for flink to provide api Table#collect for this purpose. 
>  
> Other apis such as Table#head, Table#print is also helpful.  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (FLINK-14807) Add Table#collect api for fetching data to client

Reply via email to