Re: Getting Beam(Python)-on-Flink-on-k8s to work

2020-08-26 Thread Kyle Weaver
> - With the Flink operator, I was able to submit a Beam job, but hit the issue that I need Docker installed on my Flink nodes. I haven't yet tried changing the operator's yaml files to add Docker inside them. Running Beam workers via Docker on the Flink nodes is not recommended (and probably not

Getting Beam(Python)-on-Flink-on-k8s to work

2020-08-26 Thread Eugene Kirpichov
Hi folks, I'm still working with Pachama right now; we have a Kubernetes Engine cluster on GCP and want to run Beam Python batch pipelines with custom containers against it. Flink and Cloud Dataflow are the two options; Cloud Dataflow doesn't support custom containers for ba

Re: Out-of-orderness of window results when testing stateful operators with TextIO

2020-08-26 Thread Luke Cwik
Splitting is part of the issue. Other example issues are: * "sources" that input data into the pipeline have no requirement to produce records in a time ordered manner. * timers can hold the output watermark and produce records out of order with time. All of this time ordering has a cost to perfo

RE: Info needed - pmc mailing list

2020-08-26 Thread D, Anup (Nokia - IN/Bangalore)
Thank you -Original Message- From: Robert Bradshaw Sent: Tuesday, August 25, 2020 10:06 PM To: user Cc: Jash, Shaswata (Nokia - IN/Bangalore) ; Prabhala, Anuradha (Nokia - IN/Bangalore) ; Kulkarni, Sujay (Nokia - IN/Bangalore) Subject: Re: Info needed - pmc mailing list Try priv...@