adriangb opened a new issue, #14053: URL: https://github.com/apache/datafusion/issues/14053
Follow up to #507. Predicate pruning is a powerful technique to speed up queries by skipping entire files / pieces of work based on summary statistics of the data. This issue proposes implementing predicate pruning for `NOT LIKE` expressions. I'm not sure how much can be done here, I'm guessing not much. The one case I can think of is `NOT LIKE 'prefix%' in the case that the min/max stats are `prefixa` and `prefixb` respectively or something. In this case we _know_ that every row will match the LIKE expression. To implement this you'll need to make a PR similar to https://github.com/apache/datafusion/pull/12978 and add fuzz tests (see https://github.com/apache/datafusion/pull/13253) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
