[ 
https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17760655#comment-17760655
 ] 

ASF GitHub Bot commented on PARQUET-2261:
-----------------------------------------

emkornfield commented on PR #197:
URL: https://github.com/apache/parquet-format/pull/197#issuecomment-1700012396

   @etseidl Thanks for prototyping.  One more question, approximately how many 
columns where in each file (I'm trying to understand average size increase per 
column).  Without actually doing benchmarking, I'd guess overall this growth 
probably does not add too much overhead (I'd guess if anything it is likely 
extra thrift parsing) rather then IO/memory.
   
   @gszadovszky @wgtmac thoughts?




> [Format] Add statistics that reflect decoded size to metadata
> -------------------------------------------------------------
>
>                 Key: PARQUET-2261
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2261
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-format
>            Reporter: Micah Kornfield
>            Assignee: Micah Kornfield
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to