[
https://issues.apache.org/jira/browse/HBASE-4608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229893#comment-13229893
]
Lars Hofhansl commented on HBASE-4608:
--------------------------------------
bq. I'm not wondering if this patch is worth adding? If compressible stuff is
only shrinking by half, is that big enough win? What do you lot thing? LZMA is
not viable because it takes for ever compressing though its turning SU WALs
into 11-14% original size.
You mean you are *now* wondering? :) IMHO: The WAL is probably the greatest
source of synchronous IO that we generate, cutting this in half seems quite
valuable (maybe this will be less valuable in the future if/when HDFS can do
parallel replication instead of chaining - but it is now).
I agree that none of the block based compression schemes would be good
options... Was merely curious about HLog archiving, which is quite unrelated to
this issue.
+1, let's commit this.
> HLog Compression
> ----------------
>
> Key: HBASE-4608
> URL: https://issues.apache.org/jira/browse/HBASE-4608
> Project: HBase
> Issue Type: New Feature
> Reporter: Li Pi
> Assignee: stack
> Fix For: 0.94.0
>
> Attachments: 4608-v19.txt, 4608-v20.txt, 4608-v22.txt, 4608v1.txt,
> 4608v13.txt, 4608v13.txt, 4608v14.txt, 4608v15.txt, 4608v16.txt, 4608v17.txt,
> 4608v18.txt, 4608v23.txt, 4608v24.txt, 4608v25.txt, 4608v27.txt, 4608v29.txt,
> 4608v30.txt, 4608v5.txt, 4608v6.txt, 4608v7.txt, 4608v8fixed.txt,
> hbase-4608-v28-delta.txt, hbase-4608-v28.txt, hbase-4608-v28.txt
>
>
> The current bottleneck to HBase write speed is replicating the WAL appends
> across different datanodes. We can speed up this process by compressing the
> HLog. Current plan involves using a dictionary to compress table name, region
> id, cf name, and possibly other bits of repeated data. Also, HLog format may
> be changed in other ways to produce a smaller HLog.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira