alamb opened a new issue, #8767:
URL: https://github.com/apache/arrow-datafusion/issues/8767

   ### Is your feature request related to a problem or challenge?
   
   It appears that there is no good way to know if the bloom filter code is 
working via logging or metrics 🤔
   
   We have metrics on ParquetExec that explain when row groups are pruned, but 
now that we can prune them via statistics or bloom filters, we can't tell which 
was used in a particular query
   
   Among other things this meant it was hard to reproduce / diagnose issues 
like https://github.com/apache/arrow-datafusion/issues/8685
   
   ### Describe the solution you'd like
   
   Add some metrics for parquet pruning 
   
   ### Describe alternatives you've considered
   
   @my-vegetable-has-exploded  has a branch that could be a good starting point:
    I check it by the metric I add myself(like 
https://github.com/apache/arrow-datafusion/compare/main...my-vegetable-has-exploded:arrow-datafusion:metric-sbbf?expand=1)
 , Should 
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to