[ https://issues.apache.org/jira/browse/HADOOP-6297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12998681#comment-12998681 ]
Todd Lipcon commented on HADOOP-6297: ------------------------------------- Hrm, I guess what I don't understand is why we haven't needed this method for existing file formats like SequenceFile that do block-based compression? > Hadoop's support for zlib library lacks support to perform flushes > (Z_SYNC_FLUSH and Z_FULL_FLUSH) > -------------------------------------------------------------------------------------------------- > > Key: HADOOP-6297 > URL: https://issues.apache.org/jira/browse/HADOOP-6297 > Project: Hadoop Common > Issue Type: Improvement > Components: io > Reporter: Kevin J. Price > Assignee: Kevin J. Price > Priority: Minor > Attachments: zlibpatch-0.3.patch, zlibpatch.patch > > > The zlib library supports the ability to perform two types of flushes when > deflating data. It can perform both a Z_SYNC_FLUSH, which forces all input to > be written as output and byte-aligned and resets the Huffman coding, and it > also supports a Z_FULL_FLUSH, which does the same thing but additionally > resets the compression dictionary. The Hadoop wrapper for the zlib library > does not support either of these two methods. > Adding support should be fairly trivial. An additional deflate method that > takes a fourth "flush" parameter, and a modification to the native c code to > accept this fourth parameter and pass it along to the zlib library. I can > submit a patch for this if desired. > It should be noted that the native SUN Java API is likewise missing this > functionality, as has been noted for over a decade here: > http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4206909 -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira