steveloughran commented on pull request #2069: URL: https://github.com/apache/hadoop/pull/2069#issuecomment-653619415
latest patch wires up stats collection from the workers on an s3a committer job, marshalls them as json in .pending/.pendingset files and then finally aggregates them into the _SUCCESS job summary file. Here's an example of a test run. ```json 2020-07-03 16:47:08,981 [JUnit-ITestMagicCommitProtocol-testOutputFormatIntegration] INFO commit.AbstractCommitITest (AbstractCommitITest.java:loadSuccessFile(503)) - Loading committer success file s3a://stevel-ireland/test/ITestMagicCommitProtocol-testOutputFormatIntegration/_SUCCESS. Actual contents= { "name" : "org.apache.hadoop.fs.s3a.commit.files.SuccessData/1", "timestamp" : 1593791227415, "date" : "Fri Jul 03 16:47:07 BST 2020", "hostname" : "stevel-mbp15-13176.local", "committer" : "magic", "description" : "Task committer attempt_200707120821_0001_m_000000_0", ... "diagnostics" : { "fs.s3a.authoritative.path" : "", "fs.s3a.metadatastore.impl" : "org.apache.hadoop.fs.s3a.s3guard.DynamoDBMetadataStore", "fs.s3a.committer.magic.enabled" : "true", "fs.s3a.metadatastore.authoritative" : "false" }, "filenames" : [ "/test/ITestMagicCommitProtocol-testOutputFormatIntegration/part-m-00000" ], "iostatistics" : { "counters" : { "committer_bytes_committed" : 4, "committer_bytes_uploaded" : 0, "committer_commits_aborted" : 0, "committer_commits_completed" : 1, "committer_commits_created" : 0, "committer_commits_failed" : 0, "committer_commits_reverted" : 0, "committer_jobs_completed" : 1, "committer_jobs_failed" : 0, "committer_tasks_completed" : 1, "committer_tasks_failed" : 0, "stream_write_block_uploads" : 1, "stream_write_block_uploads_data_pending" : 0, "stream_write_bytes" : 4, "stream_write_exceptions" : 0, "stream_write_exceptions_completing_uploads" : 0, "stream_write_queue_duration" : 0, "stream_write_total_data" : 4, "stream_write_total_time" : 0 }, "gauges" : { "stream_write_block_uploads_data_pending" : 4, "stream_write_block_uploads_pending" : 0, }, "minimums" : { }, "maximums" : { }, "meanStatistics" : { } } } ``` I'm in a good mood here. Time for others to look at. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org