cshuo opened a new pull request, #18716:
URL: https://github.com/apache/hudi/pull/18716
### Describe the issue this Pull Request addresses
Flink Source V2 builds its `FileIndex` from `HoodieScanContext`, but the
scan context did not carry the partition-to-bucket lookup needed by bucket
index pruning. As a result, bucket pruning information produced during filter
pushdown in `HoodieTableSource` could not be applied when Source V2 created
batch splits.
### Summary and Changelog
- Added `partitionBucketIdFunc` to `HoodieScanContext` so Source V2 scan
planning can carry bucket pruning state.
### Impact
* Enables bucket pruning for Flink Source V2 batch split generation when
reading bucket-index tables.
### Risk Level
low
<!-- Accepted values: none, low, medium or high. Other than `none`, explain
the risk.
If medium or high, explain what verification was done to mitigate the
risks. -->
### Documentation Update
<!-- Describe any necessary documentation update if there is any new
feature, config, or user-facing change. If not, put "none".
- The config description must be updated if new configs are added or the
default value of the configs are changed.
- Any new feature or user-facing change requires updating the Hudi website.
Please follow the
[instruction](https://hudi.apache.org/contribute/developer-setup#website)
to make changes to the website. -->
### Contributor's checklist
- [ ] Read through [contributor's
guide](https://hudi.apache.org/contribute/how-to-contribute)
- [ ] Enough context is provided in the sections above
- [ ] Adequate tests were added if applicable
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]