[ https://issues.apache.org/jira/browse/ORC-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16399359#comment-16399359 ]
Sandeep More commented on ORC-305: ---------------------------------- Ah, thanks [~owen.omalley] for the details, makes it much clearer. > Add column statistics for the size on disk > ------------------------------------------ > > Key: ORC-305 > URL: https://issues.apache.org/jira/browse/ORC-305 > Project: ORC > Issue Type: Test > Reporter: Owen O'Malley > Assignee: Sandeep More > Priority: Major > > It would be great to have the size on disk of each column. > You can generate this by adding up the sizes of the dictionary and data > streams. > It is only relevant at the stripe and file level. -- This message was sent by Atlassian JIRA (v7.6.3#76005)