baibaichen opened a new pull request, #9192:
URL: https://github.com/apache/incubator-gluten/pull/9192

   ## What changes were proposed in this pull request?
   (Fixes: \#8846)
   
   ### Summary of Changes
   This commit primarily:
   1. Fixes directory name capitalization from "iceberg" to "Iceberg" across 
the codebase
   2. Reorganizes and improves include statements in several files
   3. Adds new benchmark tests for Iceberg with delete operations
   4. Makes minor code improvements and bug fixes
   
   ### Key Changes
   #### Directory Structure and Naming
   - Renamed `iceberg` directory to `Iceberg` with proper capitalization
   - Updated all include paths to reflect this change
   - Updated CMakeLists.txt to use the new capitalized directory name
   
   #### Code Improvements
   - Fixed typo: `begion` → `begin` in BlockTypeUtils.h
   - Added assertion to verify input_format is not null in `NormalFileReader` 
constructor
   - Improved readability of DeltaReader::create by splitting a long line
   - Simplified row counting in ParquetFormatFile.cpp using 
`std::ranges::fold_left`
   - Added null check before using input in IcebergReader::create
   
   #### New Functionality: Added code for benchmarking Iceberg read operations 
with equality deletes and position deletes
   ```
   Run on (24 X 1285.69 MHz CPU s)
   CPU Caches:
     L1 Data 48 KiB (x12)
     L1 Instruction 32 KiB (x12)
     L2 Unified 1280 KiB (x12)
     L3 Unified 30720 KiB (x1)
   Load Average: 2.34, 1.91, 2.30
   
----------------------------------------------------------------------------------------------
   Benchmark                                                    Time            
 CPU   Iterations
   
----------------------------------------------------------------------------------------------
   BM_IcebergReadWithPositionDeletes/0/iterations:10         39.3 ms         
39.3 ms           10
   BM_IcebergReadWithPositionDeletes/1/iterations:10          156 ms          
155 ms           10
   BM_IcebergReadWithPositionDeletes/10/iterations:10         148 ms          
145 ms           10
   BM_IcebergReadWithPositionDeletes/50/iterations:10         273 ms          
267 ms           10
   BM_IcebergReadWithPositionDeletes/100/iterations:10        347 ms          
335 ms           10
   BM_IcebergReadWithEqualityDeletes/0/iterations:10         36.8 ms         
36.8 ms           10
   BM_IcebergReadWithEqualityDeletes/1/iterations:10          192 ms          
191 ms           10
   BM_IcebergReadWithEqualityDeletes/5/iterations:10          617 ms          
616 ms           10
   BM_IcebergReadWithEqualityDeletes/10/iterations:10        1821 ms         
1817 ms           10
   BM_IcebergReadWithEqualityDeletes/50/iterations:10        9716 ms         
9703 ms           10
   
   ```
   
   
   ## How was this patch tested?
   Existed UTs
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to