Re: [Discussion] flink elasticsearch connector supports

2020-06-05 Thread Etienne Chauchot
Hi Jacky Lau, 1) yes I saw that 2) I saw sources like IntegerSource which are bounded and which extend RichParallelSourceFunction. This is why I mentioned it. 3) True, there is an hadoop ES connector in Beam but it is more of a side connector, the main one is ElasticsearchIO here:

Re: [Discussion] flink elasticsearch connector supports

2020-06-05 Thread Forward Xu
hi Jacky Lau : I agree with jark's point of view, the use of es is not just to read data, more use is to group query, aggregate these. Best, Forward Jacky Lau 于2020年6月5日周五 下午2:47写道: > hi Etienne Chauchot: > you can read here https://www.jianshu.com/p/d32e17dab90c, which is > chinese.But you

Re: [Discussion] flink elasticsearch connector supports

2020-06-05 Thread Jacky Lau
hi Etienne Chauchot: you can read here https://www.jianshu.com/p/d32e17dab90c, which is chinese.But you can konw that slice api has poor performance in es-hadoop project . And i found that es-hadoop has removed this and disable sliced scrolls by default. you can see below, which i found in the

Re: [Discussion] flink elasticsearch connector supports

2020-06-04 Thread Jacky Lau
Thanks for Jacky and Yangze. 1) we will reorganize your design and move it to the flip 2) we will support lookup and DynamicSource. and current DynamicSource's optimize, such as supportLimitPushDown/supportLimitPushdown doesn't have achieved. so we will do it after it have done Thanks for your

Re: [Discussion] flink elasticsearch connector supports

2020-06-04 Thread Jacky Lau
hi Etienne Chauchot: thanks for your discussion. for 1) we do not supprt es unbouded source currently for 2) RichParallelSourceFunction is used for streaming ,InputFormat is for batch for 3) i downloaded beam just now. and the beam es connector is also using es-hadoop. i have read the code of

Re: [Discussion] flink elasticsearch connector supports

2020-06-04 Thread Jark Wu
Thanks Jacky for starting this discussion. The requirement of ES source has been proposed in the community many times. +1 for the feature from my side. Here are my thoughts: 1. streaming source As we only support bounded source for JDBC and HBase, so I think it's fine to have a bounded ES

Re: [Discussion] flink elasticsearch connector supports

2020-06-04 Thread Etienne Chauchot
Hi, I made the Elasticsearch connector of Apache Beam and I was thinking about doing the same for Flink when I came by this discussion. I have some comments regarding the design doc: 1. Streaming source: ES has data streams features but only for time series data; the aim of this source is

Re: [Discussion] flink elasticsearch connector supports

2020-06-04 Thread Yangze Guo
Hi, Jackey. Thanks for driving this discussion. I think this proposal should be a FLIP[1] since it impacts the public interface. However, as we have only some preliminary discussions atm, a design draft would be ok. But it would be better to organize your document according to [2]. I've two

Re: [Discussion] flink elasticsearch connector supports

2020-06-02 Thread Jacky Lau
Hi Robert Metzger: Thanks for your response. could you please read this docs. https://www.yuque.com/jackylau-sc7w6/bve18l/14a2ad5b7f86998433de83dd0f8ec067 . Any Is it any problem here? we are worried about we do not think throughly. thanks. -- Sent from:

Re: [Discussion] flink elasticsearch connector supports

2020-06-02 Thread Robert Metzger
Hi Jacky, thanks a lot for starting the discussion. I have no objections to adding support for reading data from ElasticSearch as well, as long as we clearly state the performance and correctness implications / guarantees in the Flink documentation. On Tue, Jun 2, 2020 at 9:52 AM Jacky Lau

[Discussion] flink elasticsearch connector supports

2020-06-02 Thread Jacky Lau
Hi all! We have started some preliminary work on the flink elasticsearch integration (es connector for es version7) at hikvision research institute. It seems that the integration should think throughly. And we want to contribute our code for the conmunity. So I think I should open a discussion