[ https://issues.apache.org/jira/browse/HIVE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13554126#comment-13554126 ]
Kevin Wilfong commented on HIVE-3897: ------------------------------------- https://cwiki.apache.org/confluence/display/Hive/RCFileCat > Add a way to get the uncompressed/compressed sizes of columns from an RC File > ----------------------------------------------------------------------------- > > Key: HIVE-3897 > URL: https://issues.apache.org/jira/browse/HIVE-3897 > Project: Hive > Issue Type: New Feature > Affects Versions: 0.11.0 > Reporter: Kevin Wilfong > Assignee: Kevin Wilfong > Fix For: 0.11.0 > > Attachments: HIVE-3897.1.patch.txt > > > The uncompressed, compressed size of each column of an RCFile is stored in > the header of an RCFile block. Currently, we have no convenient way to get > at this data. This would be useful for identifying where RCFile is doing a > poor job of compression, so that we can better focus our efforts. > RCFileCat seems like a logical tool to extend to add this. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira