[ 
https://issues.apache.org/jira/browse/PARQUET-686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854720#comment-15854720
 ] 

Julian Hyde commented on PARQUET-686:
-------------------------------------

If you're following the relevant industry standard you're not being SQL 
specific. I don't know whether there is an ISO standard, but even if there 
isn't, we'd be well advised to follow what the big DBs do. The folks at SQL 
Server and Oracle have had more time to identify the appropriate standard to 
follow. If we roll our own standard we are very likely to come up with 
something worse.

> Allow for Unsigned Statistics in Binary Type
> --------------------------------------------
>
>                 Key: PARQUET-686
>                 URL: https://issues.apache.org/jira/browse/PARQUET-686
>             Project: Parquet
>          Issue Type: Bug
>            Reporter: Andrew Duffy
>
> BinaryStatistics currently only have a min/max, which are compared as signed 
> {{byte[]}}. However, for real UTF8-friendly lexicographic comparison, e.g. 
> for string columns, we would want to calculate the BinaryStatistics based off 
> of a comparator that treats the bytes as unsigned.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to