2010YOUY01 opened a new issue, #17027:
URL: https://github.com/apache/datafusion/issues/17027

   ### Is your feature request related to a problem or challenge?
   
   `output_batches` should be a common metric in all operators. If we include 
it into `BaselineMetrics`, this can be updated automatically through 
`record_poll()` utility:
   
   
https://github.com/2010YOUY01/arrow-datafusion/blob/main/datafusion/physical-plan/src/joins/hash_join.rs#L1508
   
   
https://github.com/2010YOUY01/arrow-datafusion/blob/main/datafusion/physical-plan/src/metrics/baseline.rs#L124
   
   However, now it is only included in the join metrics, and join operators are 
updating this `output_batches` metrics manually.
   
   ### Describe the solution you'd like
   
   1. Include `output_batches` metric into `BaselineMetrics`
   2. Update `output_batches` inside `record_poll()`
   3. Remove all manual `output_batches` updates inside join operators. (by 
searching for all occurrences of 'output_batches' in the codebase).
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to