Kevin Wilfong created HIVE-3897:
-----------------------------------

             Summary: Add a way to get the uncompressed/compressed sizes of 
columns from an RC File
                 Key: HIVE-3897
                 URL: https://issues.apache.org/jira/browse/HIVE-3897
             Project: Hive
          Issue Type: New Feature
    Affects Versions: 0.11.0
            Reporter: Kevin Wilfong
            Assignee: Kevin Wilfong


The uncompressed, compressed size of each column of an RCFile is stored in the 
header of an RCFile block.  Currently, we have no convenient way to get at this 
data.  This would be useful for identifying where RCFile is doing a poor job of 
compression, so that we can better focus our efforts.

RCFileCat seems like a logical tool to extend to add this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to