Re: Guide for building Flink image with Python doesn't work

2022-07-07 Thread Xingbo Huang
Hi Gyula, According to the log, we can see that you downloaded the source package of pemja, not the wheel package of pemja[1]. I guess you are using the m1 machine. If you install pemja from the source package, you need to have JDK, gcc tools and CPython with Numpy in the environment. I believe th

Configure a kafka source dynamically (???)

2022-07-07 Thread Salva Alcántara
When using the kafka connector, you need to set the topics in advance (by giving a list of them or a regex pattern for the topic names). Imagine a situation where the topics are not known in advance, of course you could use an all-pass regex pattern to match all the topics in the broker but what I

Re: [ANNOUNCE] Apache Flink 1.15.1 released

2022-07-07 Thread Xingbo Huang
Thanks a lot for being our release manager David and everyone who contributed. Best, Xingbo David Anderson 于2022年7月8日周五 06:18写道: > The Apache Flink community is very happy to announce the release of Apache > Flink 1.15.1, which is the first bugfix release for the Apache Flink 1.15 > series. > >

Re: PyFlink: restoring from savepoint

2022-07-07 Thread Dian Fu
Hi John, Could you provide more information, e.g. the exact command submitting the job, the logs file, the PyFlink version, etc? Regards, Dian On Thu, Jul 7, 2022 at 7:53 PM John Tipper wrote: > Hi all, > > I have a PyFlink job running in Kubernetes. Savepoints and checkpoints are > being suc

[ANNOUNCE] Apache Flink 1.15.1 released

2022-07-07 Thread David Anderson
The Apache Flink community is very happy to announce the release of Apache Flink 1.15.1, which is the first bugfix release for the Apache Flink 1.15 series. Apache Flink® is an open-source stream processing framework for distributed, high-performing, always-available, and accurate data streaming a

Re: Restoring a job from a savepoint

2022-07-07 Thread John Tipper
Thank you all, that’s very helpful. It looks like there’s something else that’s causing my cluster to not load my savepoints, so I’ve submitted a separate query for that. Many thanks, John Sent from my iPhone On 6 Jul 2022, at 21:24, Alexander Fedulov wrote:  Hi John, use $ bin/flink run

PyFlink: restoring from savepoint

2022-07-07 Thread John Tipper
Hi all, I have a PyFlink job running in Kubernetes. Savepoints and checkpoints are being successfully saved to S3. However, I am unable to get the job to start from a save point. The container is started with these args: “standalone-job”, “-pym”, “foo.main”, “-s”, “s3://”, “-n” In the JM logs

Re: Difference between Session Mode and Session Job(Flink Opearator)

2022-07-07 Thread bat man
Awesome, thanks! On Thu, Jul 7, 2022 at 1:21 PM Gyula Fóra wrote: > Hi! > > The Flink Kubernetes Operator on a high level supports 3 types of > resources: > >1. Session Deployment : Empty Flink Session cluster >2. Application Deployment: Flink Application cluster (single job / >clust

Setting a timer within broadcast applyToKeyedState() (feature request)

2022-07-07 Thread James Sandys-Lumsdaine
Hello, I know we can’t set a timer in the processBroadcastElement() of the KeyedBroadcastProcessFunction as there is no key. However, there is a context.applyToKeyedState() method which allows us to iterate over the keyed state in the scope of a key. So it is possible to add access to the Tim

Re: Difference between Session Mode and Session Job(Flink Opearator)

2022-07-07 Thread Gyula Fóra
Hi! The Flink Kubernetes Operator on a high level supports 3 types of resources: 1. Session Deployment : Empty Flink Session cluster 2. Application Deployment: Flink Application cluster (single job / cluster) 3. Session Job: Flink Job deployed to an existing Session Deployment. So in