[ 
https://issues.apache.org/jira/browse/HADOOP-19072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17842856#comment-17842856
 ] 

ASF GitHub Bot commented on HADOOP-19072:
-----------------------------------------

virajjasani commented on PR #6543:
URL: https://github.com/apache/hadoop/pull/6543#issuecomment-2089769498

   The above proposal of providing list of optimization flags sounds impressive.
   
   Please let me know if this summary looks good:
   
   As part of this Jira: 
   
   - Add `fs.s3a.performance.options` as new config with only valid values as 
`create` and `mkdir` for now.
   - Create `S3APerformanceFlags` class (which can contain List of Enum 
values). Enum can be PerformanceFlag and it should be defined in `StoreContext`.
   - Mapping of the comma separated String value of 
`fs.s3a.performance.options` to `S3APerformanceFlags` object can be done as 
static utility of `S3APerformanceFlags` class.
   - Unknown flags are logged once at info
   - Provide PathCapability for `fs.s3a.performance.options.${flag}` where 
${flag} value would be create/mkdir for now. When this is probed, 
pathCapability should call `S3APerformanceFlags#hasCapability(${flag})`.
   - Document the policy for `fs.s3a.performance.options` to indicate that the 
semantic of a particular optimization flag must not change but new optimization 
option could be provided in future to tune this behavior.
   
   For future Jiras:
   - Add more optimization options for `delete`, `rename` operations.
   
   One question: IIUC, we don't need to keep the current PR behavior in case 
`fs.s3a.create.performance` is enabled, since we are not introducing new 
`fs.s3a.performance.options`, correct? Also, will it be prudent to deprecate 
config `fs.s3a.create.performance` and perhaps log at once in s3afs if user is 
still using it? Probably we can do it in separate jira too.




> S3A: expand optimisations on stores with "fs.s3a.create.performance"
> --------------------------------------------------------------------
>
>                 Key: HADOOP-19072
>                 URL: https://issues.apache.org/jira/browse/HADOOP-19072
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>    Affects Versions: 3.4.0
>            Reporter: Steve Loughran
>            Assignee: Viraj Jasani
>            Priority: Major
>              Labels: pull-request-available
>
> on an s3a store with fs.s3a.create.performance set, speed up other operations
> *  mkdir to skip parent directory check: just do a HEAD to see if there's a 
> file at the target location



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to