[ 
https://issues.apache.org/jira/browse/HADOOP-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12854316#action_12854316
 ] 

Xiao Kang commented on HADOOP-6683:
-----------------------------------

A comparision test was performed on a 1.8GB web log file. The result is as 
follows:

|| read file buffer size || write to compress stream buffer size || old 
time(secs) || new time(secs) || decrease % ||
|65536| 100 |67|        49|     26.8%|
|65536| 200| 56.5|      46.5|   17.7%|
|65536| 400| 51.5|      45|     12.6%|
|65536| 800| 48.5|      44.5|   8.2%|
|65536| 1024|   46.8|   44.2|   9.8%|
|65536| 4096|   45|     43.5|   3.3%|
|65536| 65536|  44.6|   43.2|   3.1%|


Is there any standard benchmark for compression suitable for this case?

> the first optimization: ZlibCompressor does not fully utilize the buffer
> ------------------------------------------------------------------------
>
>                 Key: HADOOP-6683
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6683
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: io
>    Affects Versions: 0.20.2
>            Reporter: Xiao Kang
>         Attachments: ZlibCompressor.java.patch
>
>
> Thanks for Hong Tang's advice.
> Sub task created for the first optimization. HADOOP-6662 closed. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to