Re: [DISCUSS] Upgrade HBase connector to 2.2.x

2020-06-19 Thread Ming Li
+1 to support both HBase 2.x and Hbase 1.4.x, just as what we are doing for Kafka. On Fri, Jun 19, 2020 at 4:02 PM Yu Li wrote: > One supplement: > > I noticed that there are discussions in HBase ML this March about removing > stable-1 pointer and got consensus [1], and will follow up in HBase

Re: Trouble with large state

2020-06-19 Thread Vijay Bhaskar
Glad to know some progress. Where are you updating your state here? I couldn't find any flink managed state here. I suggested updating the flink managed state using onTimer over an interval equal to the checkpoint interval. In your case since you do throttling, it helped to maintain the fixed rate

Difference between flink on kubernetes operator vs native kubernetes

2020-06-19 Thread SAMPAD SAHA
I was trying to deploy Flink in Kubernetes environment and came across two things: 1. Kubernetes Flink control plane developed by google and Lyft - https://github.com/lyft/flinkk8soperator - https://github.com/GoogleCloudPlatform/flink-on-k8s-operator 2. Deploying Kubernetes natively.

Re: Trouble with large state

2020-06-19 Thread Jeff Henrikson
Bhaskar, Based on your idea of limiting input to get better checkpoint behavior, I made a ProcessFunction that constraints to a number of events per second per slot per input. I do need to do some stateless input scanning before joins. The stateless part needs to be fast and does no impact

adding s3 object metadata while using StreamFileSink

2020-06-19 Thread dhurandar S
We are creating files in S3 and we want to update the S3 object metadata with some security-related information for governance purposes. Right now Apache Flink totally abstracts how and when S3 object gets created in the system. Is there a way that we can pass the S3 object metadata and update it

Re: Trouble with large state

2020-06-19 Thread Jeff Henrikson
Bhaskar, Thank you for your thoughtful points. > I want to discuss more on points (1) and (2) > If we take care of them rest will be good > > Coming to (1) > > Please try to give reasonable checkpoint interval time for every job. > Minum checkpoint interval recommended by flink community is 3 m

Re: [Announce] Flink Forward Call for Proposals Extended

2020-06-19 Thread Israel Ekpo
Thanks Seth for sharing this. I am looking forward to the event. On Fri, Jun 19, 2020 at 10:54 AM Seth Wiesman wrote: > Hi Everyone! > > > The Call for Presentations for Flink Forward has been extended until *Sunday, > June 28, 11:59 pm PST*. We know that tech conferences are not a priority > f

[Announce] Flink Forward Call for Proposals Extended

2020-06-19 Thread Seth Wiesman
Hi Everyone! The Call for Presentations for Flink Forward has been extended until *Sunday, June 28, 11:59 pm PST*. We know that tech conferences are not a priority for everyone at this moment, so we wanted to ensure everyone has time to work on their ideas. As a reminder, Flink Forward Global C

Re: [DISCUSS] Upgrade HBase connector to 2.2.x

2020-06-19 Thread Yu Li
One supplement: I noticed that there are discussions in HBase ML this March about removing stable-1 pointer and got consensus [1], and will follow up in HBase community about why we didn't take real action. However, this doesn't change my previous statement / stand due to the number of 1.x usages

Re: [DISCUSS] Upgrade HBase connector to 2.2.x

2020-06-19 Thread Yu Li
+1 on upgrading the HBase version of the connector, and 1.4.3 is indeed an old version. OTOH, AFAIK there're still quite some 1.x HBase clusters in production. We could also see that the HBase community is still maintaining 1.x release lines (with "stable-1 release" point to 1.4.13) [1] Please al

Re: [DISCUSS] Drop connectors for 5.x and restart the flink es source connector

2020-06-19 Thread Jark Wu
I'm fine with dropping support for es5. forward to dev@. Best, Jark On Fri, 19 Jun 2020 at 15:46, jackylau wrote: > Hi all: > when i coding the es source connector here > > https://github.com/liuyongvs/flink/commit/c397a759d05956629a27bf850458dd4e70330189 > for the elasticsearch source

[DISCUSS] Drop connectors for 5.x and restart the flink es source connector

2020-06-19 Thread jackylau
Hi all: when i coding the es source connector here https://github.com/liuyongvs/flink/commit/c397a759d05956629a27bf850458dd4e70330189 for the elasticsearch source connector. The doc is here https://cwiki.apache.org/confluence/display/FLINK/FLIP-127%3A+Support+Elasticsearch+Source+Connector ,a

Re: Join tables created from Datastream whose element scala type has field Option[_]

2020-06-19 Thread Jark Wu
Could you try to use blink planner? I guess this works in blink planner. Besides, it is suggested to use String with null values instead of Option[String]. Flink SQL/Table doesn't know Option and will recognize it as a RAW/Generic type which is rather slower. There should be no NPE, otherwise, it