[ 
https://issues.apache.org/jira/browse/HBASE-4608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229893#comment-13229893
 ] 

Lars Hofhansl commented on HBASE-4608:
--------------------------------------

bq. I'm not wondering if this patch is worth adding? If compressible stuff is 
only shrinking by half, is that big enough win? What do you lot thing? LZMA is 
not viable because it takes for ever compressing though its turning SU WALs 
into 11-14% original size.

You mean you are *now* wondering? :) IMHO: The WAL is probably the greatest 
source of synchronous IO that we generate, cutting this in half seems quite 
valuable (maybe this will be less valuable in the future if/when HDFS can do 
parallel replication instead of chaining - but it is now).
I agree that none of the block based compression schemes would be good 
options... Was merely curious about HLog archiving, which is quite unrelated to 
this issue.

+1, let's commit this.
                
> HLog Compression
> ----------------
>
>                 Key: HBASE-4608
>                 URL: https://issues.apache.org/jira/browse/HBASE-4608
>             Project: HBase
>          Issue Type: New Feature
>            Reporter: Li Pi
>            Assignee: stack
>             Fix For: 0.94.0
>
>         Attachments: 4608-v19.txt, 4608-v20.txt, 4608-v22.txt, 4608v1.txt, 
> 4608v13.txt, 4608v13.txt, 4608v14.txt, 4608v15.txt, 4608v16.txt, 4608v17.txt, 
> 4608v18.txt, 4608v23.txt, 4608v24.txt, 4608v25.txt, 4608v27.txt, 4608v29.txt, 
> 4608v30.txt, 4608v5.txt, 4608v6.txt, 4608v7.txt, 4608v8fixed.txt, 
> hbase-4608-v28-delta.txt, hbase-4608-v28.txt, hbase-4608-v28.txt
>
>
> The current bottleneck to HBase write speed is replicating the WAL appends 
> across different datanodes. We can speed up this process by compressing the 
> HLog. Current plan involves using a dictionary to compress table name, region 
> id, cf name, and possibly other bits of repeated data. Also, HLog format may 
> be changed in other ways to produce a smaller HLog.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to