alamb opened a new issue, #8767: URL: https://github.com/apache/arrow-datafusion/issues/8767
### Is your feature request related to a problem or challenge? It appears that there is no good way to know if the bloom filter code is working via logging or metrics 🤔 We have metrics on ParquetExec that explain when row groups are pruned, but now that we can prune them via statistics or bloom filters, we can't tell which was used in a particular query Among other things this meant it was hard to reproduce / diagnose issues like https://github.com/apache/arrow-datafusion/issues/8685 ### Describe the solution you'd like Add some metrics for parquet pruning ### Describe alternatives you've considered @my-vegetable-has-exploded has a branch that could be a good starting point: I check it by the metric I add myself(like https://github.com/apache/arrow-datafusion/compare/main...my-vegetable-has-exploded:arrow-datafusion:metric-sbbf?expand=1) , Should ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
