AlenkaF commented on issue #20107:
URL: https://github.com/apache/arrow/issues/20107#issuecomment-4405040235

   Is this still relevant and should be kept open or we can close this issue 
acknowledging:
   
   - Arrow C++ and PyArrow allow arbitrary binary data in metadata values (not 
necessarily valid UTF-8)
   - PyArrow's `ExtensionType ` storing extension metadata in binary (pickle 
dump) which can break some readers, for example. `arrow-rs`code, which enforce 
`UTF-8`. In this case one can use `ExtensionType` instead of `PyExtensionType` 
and implement `__arrow_ext_serialize__` returning valid UTF-8 bytes?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to