alamb opened a new issue, #7820:
URL: https://github.com/apache/arrow-rs/issues/7820

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   - part of https://github.com/apache/arrow-rs/issues/6736
   
   The 
[Variant](https://github.com/apache/parquet-format/blob/master/VariantEncoding.md#encoding-types)
 encoding uses different sizes for offsets in nested types to optimize the 
encoding size
   
   Specifically
   * Arrays (`VariantList`) [use 1 byte and 4 bytes for length when the list 
has <=256 and >256 list elements 
respectively](https://github.com/apache/parquet-format/blob/master/VariantEncoding.md#value-header-for-array-basic_type3)
   * Arrays (`VariantList`) [use 1 byte and 4 bytes for the offset depending on 
how large the total variant payload size 
is](https://github.com/apache/parquet-format/blob/master/VariantEncoding.md#value-header-for-array-basic_type3)
   * Objects 
[](https://github.com/apache/parquet-format/blob/master/VariantEncoding.md#value-header-for-object-basic_type2)
   
   
   **Describe the solution you'd like**
   
   I would like tests that use the `VariantBuilder` API and cover the following 
cases:
   1. `VariantList` with more than 256 elements
   2. `VariantList` with total child data length between 2^8 and 2^16 ( 
`field_offset_size_minus_1` = 1, 2 byte offsets)
   2. `VariantList` with total child data length between 2^16 and 2^24 ( 
`field_offset_size_minus_1` = 2, 3 byte offsets)
   2. `VariantList` with total child data length between 2^24 and 2^32 ( 
`field_offset_size_minus_1` = 3, 4 byte offsets)
   
   **Describe alternatives you've considered**
   <!--
   A clear and concise description of any alternative solutions or features 
you've considered.
   -->
   
   **Additional context**
   <!--
   Add any other context or screenshots about the feature request here.
   -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to