anujmodi2021 opened a new pull request, #6718:
URL: https://github.com/apache/hadoop/pull/6718

   Description of PR
   Jira Ticket: https://issues.apache.org/jira/browse/HADOOP-18656
   
   Today, when a recursive delete is issued for a large directory in ADLS Gen2 
(HNS) account, the directory deletion happens in O(1) but in backend ACL Checks 
are done recursively for each object inside that directory which in case of 
large directory could lead to request time out. Pagination is introduced in the 
Azure Storage Backend for these ACL checks.
   
   More information on how pagination works can be found on public 
documentation of [Azure Delete Path 
API](https://learn.microsoft.com/en-us/rest/api/storageservices/datalakestoragegen2/path/delete?view=rest-storageservices-datalakestoragegen2-2019-12-12).
   
   This PR contains changes to support this from client side. To trigger 
pagination, client needs to add a new query parameter "paginated" and set it to 
true along with recursive set to true. In return if the directory is large, 
server might return a continuation token back to the caller. If caller gets 
back a continuation token, it has to call the delete API again with 
continuation token along with recursive and pagination set to true. This is 
similar to directory delete of FNS account.
   
   Pagination is available only in versions "2023-08-03" onwards.
   PR also contains functional tests to verify driver works well with different 
combinations of recursive and pagination features for both HNS and FNS account.
   Full E2E testing of pagination requires large dataset to be created and 
hence not added as part of driver test suite. But extensive E2E testing has 
been performed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to