tschaub commented on issue #37807:
URL: https://github.com/apache/arrow/issues/37807#issuecomment-1747611641

   Looks useful, @chelseajonesr.
   
   My only real current use case has been to create Parquet data for tests.  
I've written a [`test.ParquetFromJSON()` 
function](https://github.com/planetlabs/gpq/blob/v0.19.0/internal/test/test.go#L24-L54)
 for this purpose.  Maybe also specific to my use case, but this relies on 
incrementally building up a schema based on a configurable number of input 
(JSON) rows - to allow for cases where `nulls` may be present in early rows and 
the appropriate field type isn't known until reading more data.  So I have an 
[Arrow schema 
builder](https://github.com/planetlabs/gpq/blob/v0.19.0/internal/pqutil/arrow.go)
 for this.  This does't yet cover all the types you might encounter with an 
arbitrary struct - I'm just adding support for the cases I need to handle.
   
   So while I think it could be useful to have something in this library to 
generate Arrow data from a slice of structs (to compliment the current 
`parquet.NewSchemaFromStruct()` function), I just wanted to say that I don't 
have an urgent need for this now.  I'll close this unless someone else thinks 
it is a worthwhile issue to keep open.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to