puneetdixit200 opened a new pull request, #18902:
URL: https://github.com/apache/hudi/pull/18902

   ### Describe the issue this Pull Request addresses
   
   Closes #18758.
   
   ### Summary and Changelog
   
   This adds column range stats support for Lance base files.
   
   - Implement `LanceUtils.readColumnStatsFromMetadata` by reading projected 
Lance columns and collecting Hudi column range metadata from Spark records.
   - Route `.lance` base files through 
`HoodieTableMetadataUtil.readColumnRangeMetadataFrom`, so column stats and 
partition stats generation can use Lance files.
   - Add a Spark Lance reader test covering min, max, null count, value count, 
and ignored missing columns.
   
   ### Impact
   
   Enables metadata column stats and partition stats generation for Lance base 
files. No new public config or API.
   
   ### Risk Level
   
   Low. The change is scoped to Lance file-format stats reads and uses the 
existing Hudi column range metadata collector.
   
   Local verification:
   
   - `git diff --check`
   - 
`JAVA_HOME=/opt/homebrew/Cellar/openjdk@17/17.0.19/libexec/openjdk.jdk/Contents/Home
 
PATH=/opt/homebrew/Cellar/openjdk@17/17.0.19/libexec/openjdk.jdk/Contents/Home/bin:$PATH
 mvn -pl hudi-common -am -DskipTests compile`
   
   ### Documentation Update
   
   None.
   
   ### Contributor's checklist
   
   - [x] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [x] Enough context is provided in the sections above
   - [x] Adequate tests were added if applicable
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to