[ https://issues.apache.org/jira/browse/HADOOP-8462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13568385#comment-13568385 ]
Govind Kamat commented on HADOOP-8462: -------------------------------------- The native-code codec shows a performance improvement of 10 to 15% over the pure-Java version using GridMix. With Intel's IPP libraries, the improvement is in the range of 50% or more. > Native-code implementation of bzip2 codec > ----------------------------------------- > > Key: HADOOP-8462 > URL: https://issues.apache.org/jira/browse/HADOOP-8462 > Project: Hadoop Common > Issue Type: Improvement > Components: io > Affects Versions: 0.23.1 > Reporter: Govind Kamat > Attachments: HADOOP-8462-2.0.2a.patch, HADOOP-8462-trunk.patch > > Original Estimate: 672h > Remaining Estimate: 672h > > The bzip2 codec supplied with Hadoop is currently available only as a Java > implementation. A version that uses the system bzip2 library can provide > improved performance and a better memory footprint. This will also make it > feasible to utilize alternative bzip2 libraries that may perform better for > specific jobs. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira