[ https://issues.apache.org/jira/browse/HADOOP-18752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17728672#comment-17728672 ]
ASF GitHub Bot commented on HADOOP-18752: ----------------------------------------- dannycjones commented on code in PR #5689: URL: https://github.com/apache/hadoop/pull/5689#discussion_r1214190161 ########## hadoop-tools/hadoop-aws/src/site/markdown/tools/hadoop-aws/directory_markers.md: ########## @@ -12,35 +12,40 @@ limitations under the License. See accompanying LICENSE file. --> -# Experimental: Controlling the S3A Directory Marker Behavior +# Controlling the S3A Directory Marker Behavior -This document discusses an experimental feature of the S3A -connector since Hadoop 3.3.1: the ability to retain directory -marker objects above paths containing files or subdirectories. +This document discusses an performance feature of the S3A +connector: directory markers are not deleted unless the +client is explicitly configured to do so. ## <a name="compatibility"></a> Critical: this is not backwards compatible! This document shows how the performance of S3 I/O, especially applications creating many files (for example Apache Hive) or working with versioned S3 buckets can increase performance by changing the S3A directory marker retention policy. -Changing the policy from the default value, `"delete"` _is not backwards compatible_. +The default policy in this release of hadoop is "keep", +which _is not backwards compatible_ with hadoop versions +released before 2021. -Versions of Hadoop which are incompatible with other marker retention policies, -as of August 2020. +The compatibility table of older releases is as follows: -| Branch | Compatible Since | Supported | -|------------|------------------|---------------------| -| Hadoop 2.x | n/a | WONTFIX | -| Hadoop 3.0 | check | Read-only | -| Hadoop 3.1 | check | Read-only | -| Hadoop 3.2 | check | Read-only | -| Hadoop 3.3 | 3.3.1 | Done | +| Branch | Compatible Since | Supported | Released | +|------------|------------------|-----------|----------| +| Hadoop 2.x | 2.10.2 | Read-only | 05/2022 | +| Hadoop 3.0 | n/a | WONTFIX | | +| Hadoop 3.1 | n/a | WONTFIX | | +| Hadoop 3.2 | 3.2.2 | Read-only | 01/2022 | +| Hadoop 3.3 | 3.3.1 | Done | 01/2021 | Review Comment: Thanks for updating this with the extra info. Do we know why the Hadoop webpages aren't formatting the original table? https://hadoop.apache.org/docs/stable/hadoop-aws/tools/hadoop-aws/directory_markers.html#The_Problem_with_Directory_Markers > Change fs.s3a.directory.marker.retention to "keep" > -------------------------------------------------- > > Key: HADOOP-18752 > URL: https://issues.apache.org/jira/browse/HADOOP-18752 > Project: Hadoop Common > Issue Type: Sub-task > Components: fs/s3 > Affects Versions: 3.3.5 > Reporter: Steve Loughran > Assignee: Steve Loughran > Priority: Major > Labels: pull-request-available > > Change the default value of "fs.s3a.directory.marker.retention" to keep; > update docs to match. > maybe include with HADOOP-17802 so we don't blow up with fewer markers being > created. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org