[ 
https://issues.apache.org/jira/browse/IMPALA-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-3073:
----------------------------------
    Issue Type: New Feature  (was: Bug)

> Verify if compressed data in avro, sequence file format could be multistream
> ----------------------------------------------------------------------------
>
>                 Key: IMPALA-3073
>                 URL: https://issues.apache.org/jira/browse/IMPALA-3073
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Backend
>    Affects Versions: Impala 2.0
>            Reporter: Juan Yu
>            Priority: Major
>
> When generating compressed text file, certain tools (like pbzip2) could 
> parallel compression and create multistream compressed data. We need to 
> verify if this applies to other file formats like parquet, avro, sequence 
> that use those codec. If yes, Codec::ProcessBlock() should support 
> multistream compressd block data. decompression should not stop when reach 
> the end of compressed stream, e.g. Z_STREAM_END, BZ_STREAM_END.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to