RE: AvroRowDeserializationSchema

2022-04-24 Thread lan tran
Yeah, I already tried that way. However, if we did not use DataStream at first. We cannot implement the Savepoint since through the doc if we use TableAPI (SQL API), the uid is generated automatically which means we cannot revert if the system is crashed. Best,Quynh Sent from Mail for Windows 

Re: AvroRowDeserializationSchema

2022-04-24 Thread Dian Fu
DebeziumAvroRowDeserializationSchema and DebeziumJsonRowDeserializationSchema are still not supported in Python DataStream API. Just take a further look at the Java implementation of DebeziumAvroDeserializationSchema and DebeziumJsonDeserializationSchema, the results type is RowData instead of

Re: AvroRowDeserializationSchema

2022-04-24 Thread Dian Fu
Yes, we should support them. For now, if you want to use them, you could create ones in your own project. You could refer to AvroRowDeserializationSchema[1] as an example. It should not be complicated as it's simply a wrapper of the Java implementation. Regards, Dian [1]

Re: Unit testing PyFlink SQL project

2022-04-24 Thread Dian Fu
Regarding the problem `python setup.py install` vs `pip install apache-flink==1.14.4`, have created an issue https://issues.apache.org/jira/browse/FLINK-27373 to track it. On Mon, Apr 25, 2022 at 9:42 AM Dian Fu wrote: > Hi John, > > I'm also using MacOS. This is the steps I'm following which I

OOM errors cause by the new KafkaSink API

2022-04-24 Thread Hua Wei Chen
Hi all, Due to FlinkKafkaConsumer and FlinkKafkaProducer will be depreciated at Flink 1.15*[1]*, we are trying to migrate the APIs to KafkaSource and KafkaSink*[2]*. At the same time, we also modified the serilizers*[3]*. Our Kafka settings are not changed*[4]*. The services are very stable

RE: AvroRowDeserializationSchema

2022-04-24 Thread lan tran
Thank Dian !! Very appreciate this.However, I have another questions related to this. In current version or any updating in future, does DataStream support DebeziumAvroRowDeserializationSchema and DebeziumJsonRowDeserializationSchema in PyFlink ? Since I look at the documentation and seem it is

how to setup working dir in Flink operator

2022-04-24 Thread 陳昌倬
Hi, We try to migrate our application from `Flink on standalone Kubernetes` to `Application mode on Flink operator`. However, we cannot configure to use local SSD for RocksDB state successful. Any through? Detail: In original `Flink on standalone Kubernetes`: - set `io.tmp.dirs` to local SSD

TaskMgr Metaspace become bigger and bigger after submitting new jobs

2022-04-24 Thread Lei Wang
I start a standalone session on a single server with only one taskMgr. The JVM metaspace will become bigger after submitting a new job. Even if I cancel the submitted job, the JVM metaspace will not decrease. After submitting about 15 times, the task manager was shut down because of OOM

Random incorrect checkpoint existence check

2022-04-24 Thread Chen-Che Huang
Hi all, We recently encountered a random issue. When our Flink application is doing checkpoint creation, it occasionally fails because it thinks the medatafile of the checkpoint already exists. However, the medata file does not exist actually. We use Flink version 1.14.4 and the checkpoints are

Re: Unit testing PyFlink SQL project

2022-04-24 Thread Dian Fu
Hi John, I'm also using MacOS. This is the steps I'm following which I have run successfully: 1) python3 -m venv .venv 2) source .venv/bin/activate 3) pip install apache-flink==1.14.4 4) python -c "import pyflink;import os;print(os.path.dirname(os.path.abspath(pyflink.__file__))+'/log')" It will

Re: Unit testing PyFlink SQL project

2022-04-24 Thread John Tipper
And now when I add further dependencies to the classpath to remove all ClassNotFound exceptions, I get a different error which I don't understand (TypeError: Could not found the Java class 'EnvironmentSettings.inStreamingMode'.), see the logs below: $ python test_table_api.py

Re: Unit testing PyFlink SQL project

2022-04-24 Thread John Tipper
I get a bit further when I add all of the transitive dependencies to the classpath, where I download these by calling mvn twice: mkdir -p out mvn org.apache.maven.plugins:maven-dependency-plugin:2.10:copy -Dartifact=org.apache.flink:flink-python_2.11:1.14.4:pom -DoutputDirectory=$(pwd)/out

Re: Unit testing PyFlink SQL project

2022-04-24 Thread John Tipper
Hi Dian, Thank you very much, that's very helpful. I'm seeing a couple of errors when I try to run the example though (Python 3.8 on Mac OS). 1. I create a fresh Python virtual env: `python -m venv .venv` 2. `source .venv/bin/activate` 3. When I tried to configure the project by

flink ????

2022-04-24 Thread ????????
flink/run jar??

FlinkSQL ????k8s??????????

2022-04-24 Thread ??????
??KyuubiFlinkSQL??k8s(Application mode)??(Session mode)Application??jar??jobgraph??SQL?? CREATE TABLE T ( id INT ) WITH ( 'connector.type' = 'filesystem', 'connector.path' = 'file:///tmp/tmp.csv', 'format.type'

FlinkSQL

2022-04-24 Thread ??????
??KyuubiFlinkSQL??k8s(Application mode)??(Session mode)Application??jar??jobgraph??SQL?? CREATE TABLE T ( id INT ) WITH ( 'connector.type' = 'filesystem', 'connector.path' = 'file:///tmp/tmp.csv', 'format.type'