adriangb commented on PR #18307: URL: https://github.com/apache/datafusion/pull/18307#issuecomment-3453344215
> I think I've figured out how to make the bloom filters very, very cheap to build: re-use the hashes calculated for the hash table so that the only thing we ever insert into the bloom filter are `u64`s. I've implemented this. I think it could be improved further using the `CASE ... ` structure to avoid checking all partitions zone map / bloom filters. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
