[ 
https://issues.apache.org/jira/browse/HIVE-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-1343:
---------------------------------

    Component/s: Serializers/Deserializers

> add an interface in RCFile to support concatenation of two files without 
> (de)compression
> ----------------------------------------------------------------------------------------
>
>                 Key: HIVE-1343
>                 URL: https://issues.apache.org/jira/browse/HIVE-1343
>             Project: Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>    Affects Versions: 0.6.0
>            Reporter: Ning Zhang
>            Assignee: He Yongqiang
>         Attachments: HIVE-1343.1.patch
>
>
> If two files are concatenated, we need to read each record in these files and 
> write them back to the destination file. The IO cost is mostly unavoidable 
> due to the lack of append functionality in HDFS. However the CPU cost could 
> be significantly reduced by avoiding compression and decompression of the 
> files.
> The File Format layer should provide API that implement the block-level 
> concatenation. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to