linliu-code commented on issue #18754: URL: https://github.com/apache/hudi/issues/18754#issuecomment-4498669199
PR up for review: https://github.com/apache/hudi/pull/18792 Addresses both Effect 1 (corrupted MDT col-stats writer) and Effect 2 (silent wrong results at read time) via writer-side detection of parquet's `hasNonNullValue=false` signal, unreliability propagation through the partition-stats aggregator, and null-safety wrapping on the data-skipping predicate translator. Note: the `s_str LIKE 'a%'` symptom in the original repro turned out to be a **separate pre-existing bug** in the StartsWith predicate translation, unrelated to NaN. Filed as #18791. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
