[ https://issues.apache.org/jira/browse/SPARK-21702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16139898#comment-16139898 ]
Steve Loughran commented on SPARK-21702: ---------------------------------------- IF this is just "directories", then there are no directories in s3. We create some mock ones for empty dirs (i.e after a mkdirs() call), through 0-byte objects. We then delete all such 0-byte objects when you write data underneath {{see S3AFilesystem.deleteUnnecessaryFakeDirectories(Path)}}. I think that's what's been causing the confusion. I'm going to close this one as invalid. sorry. FWIW, if you do want to guarantee data in a bucket is encrypted, set the bucket policy to mandate this. It's the best way to be confident that all your data is locked down: [[https://hortonworks.github.io/hdp-aws/s3-encryption/index.html]] > Structured Streaming S3A SSE Encryption Not Visible through AWS S3 GUI when > PartitionBy Used > -------------------------------------------------------------------------------------------- > > Key: SPARK-21702 > URL: https://issues.apache.org/jira/browse/SPARK-21702 > Project: Spark > Issue Type: Bug > Components: Structured Streaming > Affects Versions: 2.2.0 > Environment: Hadoop 2.7.3: AWS SDK 1.7.4 > Hadoop 2.8.1: AWS SDK 1.10.6 > Reporter: George Pongracz > Priority: Minor > Labels: security > > Settings: > .config("spark.hadoop.fs.s3a.impl", > "org.apache.hadoop.fs.s3a.S3AFileSystem") > .config("spark.hadoop.fs.s3a.server-side-encryption-algorithm", > "AES256") > When writing to an S3 sink from structured streaming the files are being > encrypted using AES-256 > When introducing a "PartitionBy" the output data files are unencrypted. > All other supporting files, metadata are encrypted > Suspect write to temp is encrypted and move/rename is not applying the SSE. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org