mapleFU commented on issue #34712:
URL: https://github.com/apache/arrow/issues/34712#issuecomment-1483864875

   The util is great, however, it's a bit-tricky here. I've  implement a 
similar size-hint in our system, here are some problems I met:
   
   1. Null variables. In Arrow Array, null-value should occupy some place, but
       field-raw size cannot represent that value.
   2. Size of FLBA/ByteArray. It's size should be variable-size-summary or
       variable-size-summary + sizeof(ByteArray) * value-count
   3. Some time Arrow data is not equal to Parquet data, like Decimal stored
       as int32 or int64.
   
   Hope that helps.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to