[ https://issues.apache.org/jira/browse/BEAM-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16120095#comment-16120095 ]
Guillaume Balaine commented on BEAM-2500: ----------------------------------------- I got s3a to work on a simple aggregation job, I just write to s3a text files and include "org.apache.hadoop" % "hadoop-aws" % "2.7.3". Is there anything we're missing ? The only trouble I had was in debugging, where my file policy was formatting ':' characters in files which gave a wrong resourceId in beam. > Add support for S3 as a Apache Beam FileSystem > ---------------------------------------------- > > Key: BEAM-2500 > URL: https://issues.apache.org/jira/browse/BEAM-2500 > Project: Beam > Issue Type: Improvement > Components: sdk-java-extensions > Reporter: Luke Cwik > Priority: Minor > > Note that this is for providing direct integration with S3 as an Apache Beam > FileSystem. > There is already support for using the Hadoop S3 connector by depending on > the Hadoop File System module[1], configuring HadoopFileSystemOptions[2] with > a S3 configuration[3]. > 1: https://github.com/apache/beam/tree/master/sdks/java/io/hadoop-file-system > 2: > https://github.com/apache/beam/blob/master/sdks/java/io/hadoop-file-system/src/main/java/org/apache/beam/sdk/io/hdfs/HadoopFileSystemOptions.java#L53 > 3: https://wiki.apache.org/hadoop/AmazonS3 -- This message was sent by Atlassian JIRA (v6.4.14#64029)