One possibility is adding type annotations sort of in the style of the
Parquet format. So these would run parallel to the types in
https://github.com/apache/arrow/blob/master/format/Schema.fbs#L194

On Wed, Feb 21, 2018 at 11:53 AM, Jacques Nadeau <jacq...@apache.org> wrote:
> I think we should consider at introducing these "business" types
> differently. Same could be said for a US zipcode type, for example.
>
> On Thu, Feb 15, 2018 at 6:36 PM, Wes McKinney <wesmck...@gmail.com> wrote:
>
>> hi Uwe,
>>
>> This seems like a good idea to me given the widespread use of UUIDs,
>> and would make use more natural for application developers.
>>
>> - Wes
>>
>> On Tue, Feb 13, 2018 at 10:03 AM, Uwe L. Korn <uw...@xhochy.com> wrote:
>> > Hello,
>> >
>> > I just opened https://issues.apache.org/jira/browse/ARROW-2152 to start
>> the discussion about adding a UUID type to the Arrow format specification.
>> In its essence a UUID is simply a 128bit array but there are often special
>> classes used for it, e.g. java.util.UUID in Java and uuid.UUID in Python.
>> These provide special functions for them as well as sometimes the knowledge
>> that a column is a UUID could be beneficial during computations. Other data
>> systems like Postgres or Parquet also have a special UUID type.
>> >
>> > While there is only a small difference to a 128bit fixed sized binary
>> array, I think providing the respective object model accessor is already a
>> good benefit.
>> >
>> > Uwe
>>

Reply via email to