Re: [PR] Make file prefetch configurable [datafusion]

via GitHub Sat, 18 Oct 2025 15:59:56 -0700


adriangb commented on PR #17758:
URL: https://github.com/apache/datafusion/pull/17758#issuecomment-3376814844


   From my testing this works but doesn't seem to have much of a positive 
benefit (does not improve query performance). Maybe something is wrong with the 
implementation. But I think fundamentally the problem is that you want to go 
wide with IO but not really with other parts (e.g. CPU work applying filters in 
predicate pushdown), and this is a bit too high of a level for that. @alamb is 
working on changes to Parquet decoding which will allow prefetching at the byte 
range level, that sounds much more appropriate since we will be able to do 
IO-only prefetching. I will continue to do some investigations  but I am going 
to close this for now to keep the PR backlog down.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] Make file prefetch configurable [datafusion]

Reply via email to