[I] Support predicate pruning for NOT LIKE expressions [datafusion]

via GitHub Thu, 09 Jan 2025 04:55:17 -0800


adriangb opened a new issue, #14053:
URL: https://github.com/apache/datafusion/issues/14053


   Follow up to #507.
   
   Predicate pruning is a powerful technique to speed up queries by skipping 
entire files / pieces of work based on summary statistics of the data.
   
   This issue proposes implementing predicate pruning for `NOT LIKE` 
expressions.
   
   I'm not sure how much can be done here, I'm guessing not much. The one case 
I can think of is `NOT LIKE 'prefix%' in the case that the min/max stats are 
`prefixa` and `prefixb` respectively or something. In this case we _know_ that 
every row will match the LIKE expression.
   
   To implement this you'll need to make a PR similar to 
https://github.com/apache/datafusion/pull/12978 and add fuzz tests (see 
https://github.com/apache/datafusion/pull/13253)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[I] Support predicate pruning for NOT LIKE expressions [datafusion]

Reply via email to