Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/595#discussion_r81625440
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -935,6 +972,11 @@ public ColumnTypeMetadata_v2
getColumnTypeInfo(String[] name) {
@JsonIgnore @Override public ParquetTableMetadataBase clone() {
return new ParquetTableMetadata_v2(files, directories,
columnTypeInfo);
}
+
+ @JsonIgnore @Override public boolean isDateCorrect() {
+ return isDateCorrect;
--- End diff --
If metadata cache file is existed Drill reads it instead of retrieving
metadata from multiple Parquet files. In the case when it was generated with
drill after this commit the value of isDateCorrect will be true. In the case
when it was generated with drill before this commit the isDateCorrect field in
metadata cache file will be absent and value of this will be false in
ParquetTableMetadata_v2.
And according to this value we just define DateCorruptionStatus (you can
see more in ParquetReaderUtility.correctDatesInMetadataCache()). The leftover
way of data checking in the cache was not changed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---