alamb commented on issue #7715:
URL: https://github.com/apache/arrow-rs/issues/7715#issuecomment-3062701191

   
   > > 
   > > I also agree with this [@scovich](https://github.com/scovich) -- 
however, I am not quite sure what the API would look like yet so I am not sure 
yet what ticket to file
   > 
   > The public API seems simple enough? A shredded variant column would 
(physically) be a `StructArray` with `typed_value` alongside its `metadata` and 
`value` fields. I would expect an `unshred_variant` kernel to take such an 
input, and produce an output that does _not_ have a `typed_value` column any 
more. The [shredding 
spec](https://github.com/apache/parquet-format/blob/master/VariantShredding.md#variant-metadata)
 requires that the `metadata` column already contain every needed variant path 
name, so it's really just a matter of rewriting the `value` column under the 
hood.
   
   
   This makes sense to me. After some sleep and coding, I think it would be a 
natural follow on when we have `VariantArray` that encapsulates the shredding
   - https://github.com/apache/arrow-rs/pull/7905
   
   Basically a kernel like 
   ```rust
   /// Unshreds Variant
   fn unshred_variant(input: VariantArray) -> VariantArray { ... }
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to