steveloughran commented on pull request #2069:
URL: https://github.com/apache/hadoop/pull/2069#issuecomment-653619415


   latest patch wires up stats collection from the workers on an s3a committer 
job, marshalls them as json in .pending/.pendingset files and then finally 
aggregates them into the _SUCCESS job summary file. Here's an example of a test 
run.
   
   ```json
   2020-07-03 16:47:08,981 
[JUnit-ITestMagicCommitProtocol-testOutputFormatIntegration] INFO  
commit.AbstractCommitITest (AbstractCommitITest.java:loadSuccessFile(503)) - 
Loading committer success file 
s3a://stevel-ireland/test/ITestMagicCommitProtocol-testOutputFormatIntegration/_SUCCESS.
 Actual contents=
   {
     "name" : "org.apache.hadoop.fs.s3a.commit.files.SuccessData/1",
     "timestamp" : 1593791227415,
     "date" : "Fri Jul 03 16:47:07 BST 2020",
     "hostname" : "stevel-mbp15-13176.local",
     "committer" : "magic",
     "description" : "Task committer attempt_200707120821_0001_m_000000_0",
   ...
     "diagnostics" : {
       "fs.s3a.authoritative.path" : "",
       "fs.s3a.metadatastore.impl" : 
"org.apache.hadoop.fs.s3a.s3guard.DynamoDBMetadataStore",
       "fs.s3a.committer.magic.enabled" : "true",
       "fs.s3a.metadatastore.authoritative" : "false"
     },
     "filenames" : [ 
"/test/ITestMagicCommitProtocol-testOutputFormatIntegration/part-m-00000" ],
     "iostatistics" : {
       "counters" : {
         "committer_bytes_committed" : 4,
         "committer_bytes_uploaded" : 0,
         "committer_commits_aborted" : 0,
         "committer_commits_completed" : 1,
         "committer_commits_created" : 0,
         "committer_commits_failed" : 0,
         "committer_commits_reverted" : 0,
         "committer_jobs_completed" : 1,
         "committer_jobs_failed" : 0,
         "committer_tasks_completed" : 1,
         "committer_tasks_failed" : 0,
         "stream_write_block_uploads" : 1,
         "stream_write_block_uploads_data_pending" : 0,
         "stream_write_bytes" : 4,
         "stream_write_exceptions" : 0,
         "stream_write_exceptions_completing_uploads" : 0,
         "stream_write_queue_duration" : 0,
         "stream_write_total_data" : 4,
         "stream_write_total_time" : 0
       },
       "gauges" : {
         "stream_write_block_uploads_data_pending" : 4,
         "stream_write_block_uploads_pending" : 0,
       },
       "minimums" : { },
       "maximums" : { },
       "meanStatistics" : { }
     }
   }
   
   ```
   
   I'm in a good mood here. Time for others to look at.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to