[ https://issues.apache.org/jira/browse/PARQUET-686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854720#comment-15854720 ]
Julian Hyde commented on PARQUET-686: ------------------------------------- If you're following the relevant industry standard you're not being SQL specific. I don't know whether there is an ISO standard, but even if there isn't, we'd be well advised to follow what the big DBs do. The folks at SQL Server and Oracle have had more time to identify the appropriate standard to follow. If we roll our own standard we are very likely to come up with something worse. > Allow for Unsigned Statistics in Binary Type > -------------------------------------------- > > Key: PARQUET-686 > URL: https://issues.apache.org/jira/browse/PARQUET-686 > Project: Parquet > Issue Type: Bug > Reporter: Andrew Duffy > > BinaryStatistics currently only have a min/max, which are compared as signed > {{byte[]}}. However, for real UTF8-friendly lexicographic comparison, e.g. > for string columns, we would want to calculate the BinaryStatistics based off > of a comparator that treats the bytes as unsigned. -- This message was sent by Atlassian JIRA (v6.3.15#6346)