[ https://issues.apache.org/jira/browse/HADOOP-18117?focusedWorklogId=726256&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-726256 ]
ASF GitHub Bot logged work on HADOOP-18117: ------------------------------------------- Author: ASF GitHub Bot Created on: 14/Feb/22 12:12 Start Date: 14/Feb/22 12:12 Worklog Time Spent: 10m Work Description: mohan3d commented on a change in pull request #3970: URL: https://github.com/apache/hadoop/pull/3970#discussion_r805781663 ########## File path: hadoop-tools/hadoop-distcp/src/site/markdown/DistCp.md.vm ########## @@ -363,6 +363,7 @@ Command Line Options | `-xtrack <path>` | Save information about missing source files to the specified path. | This option is only valid with `-update` option. This is an experimental property and it cannot be used with `-atomic` option. | | `-direct` | Write directly to destination paths | Useful for avoiding potentially very expensive temporary file rename operations when the destination is an object store | | `-useiterator` | Uses single threaded listStatusIterator to build listing | Useful for saving memory at the client side. Using this option will ignore the numListstatusThreads option | +| `-updateRootDirectoryAttributes` | Update root directory attributes (eg permissions, ownership ...) | Useful if you need to enforce root directory attributes update when using distcp | Review comment: @ferhui should I re-work the internal code so everything is `updateRoot` instead of `updateRootDirectoryAttributes` or only the enduser `distcp` tool? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 726256) Time Spent: 1h 40m (was: 1.5h) > Add an option to preserve root directory permissions > ---------------------------------------------------- > > Key: HADOOP-18117 > URL: https://issues.apache.org/jira/browse/HADOOP-18117 > Project: Hadoop Common > Issue Type: Improvement > Reporter: Mohanad Elsafty > Assignee: Mohanad Elsafty > Priority: Minor > Labels: pull-request-available > Time Spent: 1h 40m > Remaining Estimate: 0h > > As mentioned in https://issues.apache.org/jira/browse/HADOOP-15211 > > If *-update* or *-overwrite* is being passed when *distcp* used, the root > directory will be skipped in two occasions (CopyListing#doBuildListing & > CopyCommitter#preserveFileAttributesForDirectories), which will ignore root > directory's attributes. > > We face the same issue when distcp huge data between clusters and it takes > too much effort to update root directories attributes manually. > > From the earlier ticket it's obvious why this behaviour is there, but > sometime we need to enforce root directory update hence I will add a new > option for distcp to enable someone (who understands the need of this and > know what they are doing) to enforce the update of root directory's > attributes (permissions, ownership, ...) > > It should be simple one, something like this > {code:java} > $ hadoop distcp -p -update -updateRootDirectoryAttributes /a/b/c /a/b/d {code} > This behaviour is optional and will be *false* by default. (it should not > affect existing *distcp* users). -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org