raunaqmorarka commented on code in PR #216:
URL: https://github.com/apache/parquet-format/pull/216#discussion_r1333328796
##########
src/main/thrift/parquet.thrift:
##########
@@ -216,7 +216,12 @@ struct Statistics {
/** count of distinct values occurring */
4: optional i64 distinct_count;
/**
- * Min and max values for the column, determined by its ColumnOrder.
+ * lower and upper bound values for the column, determined by its
ColumnOrder.
Review Comment:
Given the feature in https://issues.apache.org/jira/browse/PARQUET-1685, I
want to assume that all existing stats are truncated. Going forward we should
have a flag to explicitly indicate whether or not truncation took place and
applications should perform aggregation pushdown only if that flag is found to
indicate no truncation. But I think adding that flag can be tackled separately
as a follow-up.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]