[
https://issues.apache.org/jira/browse/IMPALA-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Armstrong updated IMPALA-3073:
----------------------------------
Issue Type: New Feature (was: Bug)
> Verify if compressed data in avro, sequence file format could be multistream
> ----------------------------------------------------------------------------
>
> Key: IMPALA-3073
> URL: https://issues.apache.org/jira/browse/IMPALA-3073
> Project: IMPALA
> Issue Type: New Feature
> Components: Backend
> Affects Versions: Impala 2.0
> Reporter: Juan Yu
> Priority: Major
>
> When generating compressed text file, certain tools (like pbzip2) could
> parallel compression and create multistream compressed data. We need to
> verify if this applies to other file formats like parquet, avro, sequence
> that use those codec. If yes, Codec::ProcessBlock() should support
> multistream compressd block data. decompression should not stop when reach
> the end of compressed stream, e.g. Z_STREAM_END, BZ_STREAM_END.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]