Gentle thread bump.
On Thu, Jan 18, 2018 at 4:03 PM, Pradeep Gollakota
wrote:
> Hi All,
>
> Can one of you review my PR at https://github.com/apache/
> parquet-mr/pull/447 please?
>
> Thanks,
> Pradeep
>
You can use the parquet-cli or parquet-tools to inspect metadata, including
compression codecs.
Dictionary pages are compressed with the same compression codec that the
other pages use.
On Tue, Jan 30, 2018 at 10:19 AM, Hao Luo wrote:
> Thanks Ryan.
>
> For the dictionary
Thanks Ryan.
For the dictionary page, is it compressed using LZO? How do I find out what
codec is used when compressing the dictionary page?
Hao
On Tue, Jan 30, 2018 at 8:28 AM, Ryan Blue
wrote:
> Parquet doesn't use LZOP. The supported formats are the ones you see
notes:
Julien (Wework)
Gidon (IBM): secure analytics. JIRA + Draft
Ryan (Netflix): Parquet-787 needs review
Lars (Cloudera, Impala): Discuss Zoltan’s proposal. Feature sets
Jim (Cloudera, Impala): Bloom filters
Zoltan (Cloudera): Java 8 transition, breaking changes management
Gabor (Cloudera):
happening now: meet.google.com/nhj-cvpt-atx
Parquet doesn't use LZOP. The supported formats are the ones you see in
CompressionCodecName, which is used to identify the compression codec.
You can see the file metadata, including compression, using the parquet-cli
module's meta command.
rb
On Mon, Jan 29, 2018 at 6:50 PM, Hao Luo