Re: Read parquet data from S3 with Flink 1.12

2021-12-20 Thread Alexandre Montecucco
Hello Piotrek, Thank you for the help. Regarding the S3 issue I have followed the documentation for the plugins. Many of our other apps are using S3 through the Hadoop Fs Flink plugin. Also, in this case, just reading regular plain text file works, I only have an issue when using Parquet. I tried

回复:Re: Will Flink loss some old Keyed State when changing the parallelism

2021-12-20 Thread 杨浩
Thanks for your replay. If we don't set the max parallelism, and we change the parallelism to a very big num, will the state loss? At 2021-11-27 01:20:49, "Yun Tang" wrote: >Hi Yang, > >Flink keeps the max key groups the same no matter how parallelism changes, and >use this to avoi

Re: Flink fails to load class from configured classpath using PipelineOptions

2021-12-20 Thread Yang Wang
Yes. You need to set the "pipeline.classpath" via flink-conf.yaml or CLI options(-C/--classpath). I do not think setting it in your main class could work. Just like you said, the user classloader will not be updated after the user main class is executed. Best, Yang Pouria Pirzadeh 于2021年12月18日周六

Re: unaligned checkpoint for job with large start delay

2021-12-20 Thread Piotr Nowojski
Hi Mason, Those checkpoint timeouts (30 minutes) have you already observed with the alignment timeout set to 0ms? Or as you were previously running it with 1s alignment timeout? If the latter, it might be because unaligned checkpoints are failing to kick in in the first place. Setting the timeout

flink-playground docker/mvn clean install Unknown host repo.maven.apache.org: Name or service not known

2021-12-20 Thread HG
Hello I am trying to the flink-playground examples. The docker-compose build fails on the mvn clean install command . I am behind a proxy. To diagnose this I started a container based on the already created image docker run -it --name my-maven-project -v "$(pwd)":/usr/src/mymaven -w /usr/src/my

Re: Re: Will Flink loss some old Keyed State when changing the parallelism

2021-12-20 Thread Seth Wiesman
No. The default max parallelism of 128 will be applied. If you try to restore above that value, the restore will fail and you can simply restore at a smaller value. No data loss. On Mon, Dec 20, 2021 at 2:28 AM 杨浩 wrote: > > Thanks for your replay. If we don't set the max parallelism, and we ch

Re: CVE-2021-44228 - Log4j2 vulnerability

2021-12-20 Thread Martijn Visser
Hi, The status and Flink ticket for upgrading to Log4j 2.17.0 can be tracked at https://issues.apache.org/jira/browse/FLINK-25375. Best regards, Martijn On Sat, 18 Dec 2021 at 16:50, V N, Suchithra (Nokia - IN/Bangalore) < suchithra@nokia.com> wrote: > Hi, > > > > It seems there is high se

Re: Read parquet data from S3 with Flink 1.12

2021-12-20 Thread Alexandre Montecucco
Hello, I also face the same issue as documented in a previous mail from the mailing list [1] Basically when using flink-parquet, I get: > java.lang.ClassNotFoundException: org.apache.hadoop.conf.Configuration I have no idea what I need to do to fix this and could not find anything from the doc.