Hadoop's support for zlib library lacks support to perform flushes 
(Z_SYNC_FLUSH and Z_FULL_FLUSH)
--------------------------------------------------------------------------------------------------

                 Key: HADOOP-6297
                 URL: https://issues.apache.org/jira/browse/HADOOP-6297
             Project: Hadoop Common
          Issue Type: Improvement
          Components: io
    Affects Versions: 0.21.0
            Reporter: Kevin J. Price
            Priority: Minor


The zlib library supports the ability to perform two types of flushes when 
deflating data. It can perform both a Z_SYNC_FLUSH, which forces all input to 
be written as output and byte-aligned and resets the Huffman coding, and it 
also supports a Z_FULL_FLUSH, which does the same thing but additionally resets 
the compression dictionary.  The Hadoop wrapper for the zlib library does not 
support either of these two methods.

Adding support should be fairly trivial.  An additional deflate method that 
takes a fourth "flush" parameter, and a modification to the native c code to 
accept this fourth parameter and pass it along to the zlib library.  I can 
submit a patch for this if desired.

It should be noted that the native SUN Java API is likewise missing this 
functionality, as has been noted for over a decade here: 
http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4206909

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to