[
https://issues.apache.org/jira/browse/HIVE-1838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12969144#action_12969144
]
He Yongqiang commented on HIVE-1838:
------------------------------------
No. I mean compression codec for Hive. It could be used to compress
intermediate data.
Here are some results:
5. Hadoop compression with native library (COMPRESSLEVEL=BEST_SPEED)
time java
-Djava.library.path=/data/users/heyongqiang/hadoop-0.20/build/native/Linux-amd64-64/lib/
CompressFile
real 0m34.179s
user 0m29.031s
sys 0m1.607s
compressed size: 275M
6. LZF
[heyongqi...@dev782 compress_test]$ time lzf -c 000000_0
real 0m39.031s
user 0m8.727s
sys 0m2.231s
compressed size: 393M
7. FastLZ
time fastlz/6pack -1 000000_0 000000_0.fastlz
real 0m19.020s
user 0m18.083s
sys 0m0.935s
compressed size: 391M
8.QuickLZ
time ./compress_file ../000000_0 ../000000_0.quicklz
real 0m15.652s
user 0m14.047s
sys 0m1.603s
compressed size: 334M
I modified QuickLZ's compress_file code to use a buffer for fairness. It turns
out the result is very close to FastLZ. The modified version of QuickLZ is just
one second better.
> Add quickLZ compression codec for Hive.
> ---------------------------------------
>
> Key: HIVE-1838
> URL: https://issues.apache.org/jira/browse/HIVE-1838
> Project: Hive
> Issue Type: New Feature
> Reporter: He Yongqiang
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.