Chinmay Sumant created FLINK-27333:
--------------------------------------
Summary: Upgrade flink-filesystems hadoop version to version 3.3.2
Key: FLINK-27333
URL: https://issues.apache.org/jira/browse/FLINK-27333
Project: Flink
Issue Type: Improvement
Components: FileSystems
Affects Versions: 1.14.3
Reporter: Chinmay Sumant
We have a security requirement to client side encrypt flink state for certain
flink applications that process sensitive data.
Currently, there is no feature that supports this out of the box on AWS S3
backend.
We found that one way to do it is to use flink-s3-fs-hadoop compiled against
hadoop 3.3.2 for checkpoints as hadoop 3.3.2 provides out of the box AWS client
side encryption using AWS KMS keys before writing the data to S3.
We were able to change the flink-filesystems shaded hadoop version from
existing 3.2.2 version to version 3.3.2 and compile with minimal code changes.
The resultant flink-s3-fs-hadoop jar was used in the checkpoint plugin path for
our flink jobs and worked well for checkpoints/savepoints upto 250 GB each with
client side encryption using AWS KMS.
Filing this Jira to request to take these changes upstream and also to check if
there are concerns with changing the hadoop version that may affect any other
components since our observations have been limited to plugin jar and
checkpoints using flink-s3-fs-hadoop filesystem.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)