Kevin Wilfong created HIVE-3897: ----------------------------------- Summary: Add a way to get the uncompressed/compressed sizes of columns from an RC File Key: HIVE-3897 URL: https://issues.apache.org/jira/browse/HIVE-3897 Project: Hive Issue Type: New Feature Affects Versions: 0.11.0 Reporter: Kevin Wilfong Assignee: Kevin Wilfong
The uncompressed, compressed size of each column of an RCFile is stored in the header of an RCFile block. Currently, we have no convenient way to get at this data. This would be useful for identifying where RCFile is doing a poor job of compression, so that we can better focus our efforts. RCFileCat seems like a logical tool to extend to add this. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira