adriangb commented on issue #17171: URL: https://github.com/apache/datafusion/issues/17171#issuecomment-3280251117
I haven't run the benchmarks because we don't have an implementation yet but I suspect that even in moderate cases the bloom filter might be faster than the hash table. At least that's what the sources cited in https://github.com/apache/datafusion/issues/17171#issuecomment-3264518297 and https://github.com/apache/datafusion/issues/17171#issuecomment-3266105128 seem to suggest. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org