HuangZhenQiu opened a new pull request, #18790:
URL: https://github.com/apache/hudi/pull/18790

   ### Describe the issue this Pull Request addresses
   
   Add shard assign metrics for bucket assign function. We have the 
GlobalRecordIndexPartitioner that assigns potional of RLI shards to each bucket 
assign task so that there is no cross task access to the same RLI shard, the 
metrics will measure the the assignment of each task and we could detect skew 
if the assignment is not that even.
   
   Close #18732 
   
   ### Summary and Changelog
   
     1. GlobalRecordIndexPartitioner: Expose the file group count computation 
as a static package-private method and add a computeNumShardsAssigned helper.
     2. FlinkBucketAssignMetrics: Add a numShardsAssigned gauge backed by an 
AtomicInteger.
     3. BucketAssignFunction.open(): When global RLI is active, compute the 
shard count for this task and push it to metrics.
   
   ### Impact
   
   none
   
   ### Risk Level
   
   none
   
   
   ### Documentation Update
   
   none
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Enough context is provided in the sections above
   - [ ] Adequate tests were added if applicable
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to