Hi all,

I wanted to make sure I understood the distinction/use cases for choosing 
between the utf8 and binary logical types.

Based on this doc 
<https://arrow.apache.org/docs/format/Metadata.html#utf8-and-binary>

* Utf8 data is Unicode values with UTF-8 encoding
* Binary is any other variable length bytes

I wonder what is the correct way to consume a binary array.  It seems like a 
binary array is likely representing some string data but without the encoding 
it isn't not clear how to safely interpret it.  Is there a convention (e.g. 
assume a binary type is ASCII encoded) that we can follow?

Many thanks,

Hatem

Reply via email to