[ https://issues.apache.org/jira/browse/MAPREDUCE-4354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13417337#comment-13417337 ]
Robert Joseph Evans commented on MAPREDUCE-4354: ------------------------------------------------ The test results look great to me, but my comment about contributing this to trunk is off base. My ignorance is showing :). The LZO compression libraries that you modified are not hosted here. You need to look at http://code.google.com/a/apache-extras.org/p/hadoop-gpl-compression/?redir=1 or https://github.com/omalley/hadoop-gpl-compression And email the dev list there. Owen O'Mally is probably the right person to talk to there about getting this patch in. Once it is in it should work both on trunk and 0.20.205 > Performance improvement with compressor object reinit restriction > ----------------------------------------------------------------- > > Key: MAPREDUCE-4354 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4354 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: performance > Affects Versions: 0.20.205.0 > Reporter: Ankit Kamboj > Priority: Minor > Labels: performance > Fix For: 0.20.205.0 > > Attachments: codec_reinit_diff, modify_lzo_codec_reinit > > > HADOOP-5879 patch aimed at picking the conf (instead of default) settings for > GzipCodec. It also involved re-initializing the recycled compressor object. > On our performance tests, this re-initialization led to performance > degradation of 15% for LzoCodec because re-initialization for Lzo involves > reallocation of buffers. LzoCodec takes the initial settings from config so > it is not necessary to re-initialize it. This patch checks for the codec > class and calls reinit only if the codec class is Gzip. This led to > significant performance improvement of 15% for LzoCodec. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira