[
https://issues.apache.org/jira/browse/ORC-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16439582#comment-16439582
]
Sandeep More commented on ORC-305:
----------------------------------
A quick update on my status, I am done with the code code and currently working
on fixing the tests.
BTW I am excluding the ROW_INDEX and BLOOM_FILTER_UTF8 from bytes on disk
calculation.
> Add column statistics for the size on disk
> ------------------------------------------
>
> Key: ORC-305
> URL: https://issues.apache.org/jira/browse/ORC-305
> Project: ORC
> Issue Type: Test
> Reporter: Owen O'Malley
> Assignee: Sandeep More
> Priority: Major
>
> It would be great to have the size on disk of each column.
> You can generate this by adding up the sizes of the dictionary and data
> streams.
> It is only relevant at the stripe and file level.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)