steveloughran commented on pull request #2069:
URL: https://github.com/apache/hadoop/pull/2069#issuecomment-680292512


   ...no, I haven't reviewed anyone elses work today, just been rebasing and 
wiring up the marker and async listing changes.
   
   To put them both together, the "s3guard markers" audit tool, will for a 
verbose run, print the stats from the listing
   
   ```
    bin/hadoop s3guard markers -audit -limit 20000 -verbose s3a://landsat-pds/
   The directory marker policy of s3a://landsat-pds is "Delete"
   2020-08-25 22:57:56,738 [main] INFO  tools.MarkerTool 
(DurationInfo.java:<init>(77)) - Starting: marker scan s3a://landsat-pds/
   Scanned 1,000 objects
   Scanned 2,000 objects
   Scanned 3,000 objects
   Scanned 4,000 objects
   Scanned 5,000 objects
   Scanned 6,000 objects
   Scanned 7,000 objects
   Scanned 8,000 objects
   Scanned 9,000 objects
   Scanned 10,000 objects
   Scanned 11,000 objects
   Scanned 12,000 objects
   Scanned 13,000 objects
   Scanned 14,000 objects
   Scanned 15,000 objects
   Scanned 16,000 objects
   Scanned 17,000 objects
   Scanned 18,000 objects
   Scanned 19,000 objects
   Scanned 20,000 objects
   Limit of scan reached - 20,000 objects
   
   Listing statistics:
     counters=((object_continue_list_request=20) (object_list_request=1)); 
gauges=(); minimums=((object_continue_list_request.min=441) 
(object_list_request.min=1375)); maximums=((object_list_request.max=1375) 
(object_continue_list_request.max=790)); 
means=((object_list_request.mean=(sum=1375, samples=1, mean=1375.0000)) 
(object_continue_list_request.mean=(sum=10145, samples=19, mean=533.9474))); 
   
   2020-08-25 22:58:08,303 [main] INFO  tools.MarkerTool 
(DurationInfo.java:close(98)) - marker scan s3a://landsat-pds/: duration 
0:11.566s
   Listed 20000 objects under s3a://landsat-pds/
   
   No surplus directory markers were found under s3a://landsat-pds/
   Listing limit reached before completing the scan
   
   Storage Statistics for s3a://landsat-pds
   
   op_get_file_status   1
   object_list_request  2
   object_continue_list_request 20
   
   2020-08-25 22:58:08,312 [main] INFO  util.ExitUtil 
(ExitUtil.java:terminate(210)) - Exiting with status 3: 
   ```
   
   Even for this operation we are saving a couple of seconds: the sum of all 
the list operations are 13s, but the wall time is only 11 seconds. That's two 
seconds of my life saved :)


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to