[
https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17760655#comment-17760655
]
ASF GitHub Bot commented on PARQUET-2261:
-----------------------------------------
emkornfield commented on PR #197:
URL: https://github.com/apache/parquet-format/pull/197#issuecomment-1700012396
@etseidl Thanks for prototyping. One more question, approximately how many
columns where in each file (I'm trying to understand average size increase per
column). Without actually doing benchmarking, I'd guess overall this growth
probably does not add too much overhead (I'd guess if anything it is likely
extra thrift parsing) rather then IO/memory.
@gszadovszky @wgtmac thoughts?
> [Format] Add statistics that reflect decoded size to metadata
> -------------------------------------------------------------
>
> Key: PARQUET-2261
> URL: https://issues.apache.org/jira/browse/PARQUET-2261
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-format
> Reporter: Micah Kornfield
> Assignee: Micah Kornfield
> Priority: Major
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)