[ https://issues.apache.org/jira/browse/HADOOP-18938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17811048#comment-17811048 ]
ASF GitHub Bot commented on HADOOP-18938: ----------------------------------------- steveloughran commented on code in PR #6466: URL: https://github.com/apache/hadoop/pull/6466#discussion_r1467048011 ########## hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/DefaultS3ClientFactory.java: ########## @@ -361,6 +366,13 @@ private static URI getS3Endpoint(String endpoint, final Configuration conf) { */ private static Region getS3RegionFromEndpoint(String endpoint) { + // S3 VPC endpoint parsing + Matcher matcher = VPC_ENDPOINT_PATTERN.matcher(endpoint); + if(matcher.find()) { + LOG.debug("Endpoint {} is vpc endpoint; parsing", endpoint); + return Region.of(matcher.group(1)); Review Comment: add a debug log saying "mapping to vpce" ########## hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/DefaultS3ClientFactory.java: ########## @@ -361,6 +366,13 @@ private static URI getS3Endpoint(String endpoint, final Configuration conf) { */ private static Region getS3RegionFromEndpoint(String endpoint) { + // S3 VPC endpoint parsing + Matcher matcher = VPC_ENDPOINT_PATTERN.matcher(endpoint); + if(matcher.find()) { Review Comment: nit, add a space after `if` ########## hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/DefaultS3ClientFactory.java: ########## @@ -361,6 +366,13 @@ private static URI getS3Endpoint(String endpoint, final Configuration conf) { */ private static Region getS3RegionFromEndpoint(String endpoint) { + // S3 VPC endpoint parsing + Matcher matcher = VPC_ENDPOINT_PATTERN.matcher(endpoint); + if(matcher.find()) { + LOG.debug("Endpoint {} is vpc endpoint; parsing", endpoint); Review Comment: so this is going to match on anything with .vpce. isn't it? I think it should include amazonaws.{com,com.cn} at the end so if someone ever sets up an internal host called vpce there's no confusion. > S3A region logic to handle vpce and non standard endpoints > ----------------------------------------------------------- > > Key: HADOOP-18938 > URL: https://issues.apache.org/jira/browse/HADOOP-18938 > Project: Hadoop Common > Issue Type: Sub-task > Components: fs/s3 > Affects Versions: 3.4.0 > Reporter: Ahmar Suhail > Priority: Major > Labels: pull-request-available > > For non standard endpoints such as VPCE the region parsing added in > HADOOP-18908 doesn't work. This is expected as that logic is only meant to be > used for standard endpoints. > If you are using a non-standard endpoint, check if a region is also provided, > else fail fast. > Also update documentation to explain to region and endpoint behaviour with > SDK V2. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org