Kevin Wilfong created HIVE-3897:
-----------------------------------
Summary: Add a way to get the uncompressed/compressed sizes of
columns from an RC File
Key: HIVE-3897
URL: https://issues.apache.org/jira/browse/HIVE-3897
Project: Hive
Issue Type: New Feature
Affects Versions: 0.11.0
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
The uncompressed, compressed size of each column of an RCFile is stored in the
header of an RCFile block. Currently, we have no convenient way to get at this
data. This would be useful for identifying where RCFile is doing a poor job of
compression, so that we can better focus our efforts.
RCFileCat seems like a logical tool to extend to add this.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira