[ https://issues.apache.org/jira/browse/HADOOP-14407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014519#comment-16014519 ]
Yongjun Zhang commented on HADOOP-14407: ---------------------------------------- Thanks [~steve_l]. {quote} .that's something whoever commits it needs to do {quote} I personally think it's better for developer to clean it up so jenkins test reports clean result. Especially for new contributors, once they are aware of this, they can easily do so before submitting the patch. It'd be nice that the last patch rev attached the jira is what get committed without change. Thanks for the updated patch [~omkarksa], are you guys still looking into setting input and output buffer to different size? Or any chance we need to do that in the future? Somehow your submitting the patch did not trigger a jenkins test, maybe there is an infra issue. > DistCp - Introduce a configurable copy buffer size > -------------------------------------------------- > > Key: HADOOP-14407 > URL: https://issues.apache.org/jira/browse/HADOOP-14407 > Project: Hadoop Common > Issue Type: Improvement > Components: tools/distcp > Affects Versions: 2.9.0 > Reporter: Omkar Aradhya K S > Assignee: Omkar Aradhya K S > Fix For: 2.9.0, 3.0.0-alpha3 > > Attachments: HADOOP-14407.001.patch, HADOOP-14407.002.patch, > TotalTime-vs-CopyBufferSize.jpg > > > Currently, the RetriableFileCopyCommand has a fixed copy buffer size of just > 8KB. We have noticed in our performance tests that with bigger buffer sizes > we saw upto ~3x performance boost. Hence, making the copy buffer size a > configurable setting via the new parameter <copybuffersize>. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org