adriangb opened a new issue, #14053:
URL: https://github.com/apache/datafusion/issues/14053

   Follow up to #507.
   
   Predicate pruning is a powerful technique to speed up queries by skipping 
entire files / pieces of work based on summary statistics of the data.
   
   This issue proposes implementing predicate pruning for `NOT LIKE` 
expressions.
   
   I'm not sure how much can be done here, I'm guessing not much. The one case 
I can think of is `NOT LIKE 'prefix%' in the case that the min/max stats are 
`prefixa` and `prefixb` respectively or something. In this case we _know_ that 
every row will match the LIKE expression.
   
   To implement this you'll need to make a PR similar to 
https://github.com/apache/datafusion/pull/12978 and add fuzz tests (see 
https://github.com/apache/datafusion/pull/13253)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to