Joe McDonnell created IMPALA-12108:
--------------------------------------

             Summary: Add support for writing data with LZ4's high compression 
mode
                 Key: IMPALA-12108
                 URL: https://issues.apache.org/jira/browse/IMPALA-12108
             Project: IMPALA
          Issue Type: Improvement
          Components: Backend
    Affects Versions: Impala 4.3.0
            Reporter: Joe McDonnell


LZ4 has a high compression mode that gets higher compression ratios than Snappy 
while maintaining high decompression speeds. The tradeoff is that compression 
is very slow. We should add support for writing data with LZ4 high compression 
mode. This would let us get a sense of the performance for writing and reading.

See this benchmark on the LZ4 page:

https://github.com/lz4/lz4#benchmarks

In my hand tests, Parquet/LZ4 is about 13% smaller than Parquet/Snappy, but it 
retains the fast decompression.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to