puneetdixit200 opened a new pull request, #18902: URL: https://github.com/apache/hudi/pull/18902
### Describe the issue this Pull Request addresses Closes #18758. ### Summary and Changelog This adds column range stats support for Lance base files. - Implement `LanceUtils.readColumnStatsFromMetadata` by reading projected Lance columns and collecting Hudi column range metadata from Spark records. - Route `.lance` base files through `HoodieTableMetadataUtil.readColumnRangeMetadataFrom`, so column stats and partition stats generation can use Lance files. - Add a Spark Lance reader test covering min, max, null count, value count, and ignored missing columns. ### Impact Enables metadata column stats and partition stats generation for Lance base files. No new public config or API. ### Risk Level Low. The change is scoped to Lance file-format stats reads and uses the existing Hudi column range metadata collector. Local verification: - `git diff --check` - `JAVA_HOME=/opt/homebrew/Cellar/openjdk@17/17.0.19/libexec/openjdk.jdk/Contents/Home PATH=/opt/homebrew/Cellar/openjdk@17/17.0.19/libexec/openjdk.jdk/Contents/Home/bin:$PATH mvn -pl hudi-common -am -DskipTests compile` ### Documentation Update None. ### Contributor's checklist - [x] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [x] Enough context is provided in the sections above - [x] Adequate tests were added if applicable -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
