[jira] [Updated] (HUDI-9767) Upstream Trino Improvements to Hudi-Trino

Voon Hou (Jira) Wed, 27 Aug 2025 04:04:26 -0700


     [ 
https://issues.apache.org/jira/browse/HUDI-9767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Voon Hou updated HUDI-9767:
---------------------------
    Description: 
Upstreaming a bunch of Trino improvements to Hudi-Trino and they are:

 
 # Increase Default MaxOutstandingSplits and SplitLoaderParallelism
 # HUDI-9525 Extend file system cache support to Hudi connector
 # Fix split generation parallelism for a non-partitioned table
 # -[MINOR] Disable PR labeler- (Skipped as this is modifies a file in 
.github/workflows)
 # -Add parquet page skipping in Iceberg Connector- (Skipped as this is for 
testing purposes)
 # HUDI-9577 Make target result size configurable from endpoint and server
 # -Create pipeline to build arm image- (Skipped as this modifies github 
actions)
 # -[MINOR] Revert changes made to non-hudi modules- (Skipped as this is 
already upstreamed)
 # [MINOR] Added optimizations for HudiColumnStatsIndexSupport
 # [MINOR] Cleanup HudiSplitFactory to extend cache support
 # Fix flakiness when testing cache correctness
 # [Trino] Enable Metadata Table by default
 # [Trino] Fix flaky tests due to table stats computation lagging behind query 
execution
 # Implement Metadata table based Partition listing
 # Fix Case Sensitivity Issues Between Table and Catalog Schemas
 # [Trino] Workers should use latest commit time from table handle
 # Incorrect query results for Merge-On-Read (RT) tables when column stats are 
enabled

 

Upstream starts at commit hash (inclusive): 
ea7f22d0371173a31be0c693a24fa00b7374fe0f

Upstream ends at commit hash (inclusive): 
5e3ebd5b4edd0623e041109cc769f0123bcbd4a7

  was:
Upstreaming a bunch of Trino improvements to Hudi-Trino and they are:

 
 # Increase Default MaxOutstandingSplits and SplitLoaderParallelism
 # HUDI-9525 Extend file system cache support to Hudi connector
 # Fix split generation parallelism for a non-partitioned table
 # [MINOR] Disable PR labeler
 # -Add parquet page skipping in Iceberg Connector- (Skipped as this is for 
testing purposes)
 # HUDI-9577 Make target result size configurable from endpoint and server
 # -Create pipeline to build arm image- (Skipped as this modifies github 
actions)
 # -[MINOR] Revert changes made to non-hudi modules- (Skipped as this is 
already upstreamed)
 # [MINOR] Added optimizations for HudiColumnStatsIndexSupport
 # [MINOR] Cleanup HudiSplitFactory to extend cache support
 # Fix flakiness when testing cache correctness
 # [Trino] Enable Metadata Table by default
 # [Trino] Fix flaky tests due to table stats computation lagging behind query 
execution
 # Implement Metadata table based Partition listing
 # Fix Case Sensitivity Issues Between Table and Catalog Schemas
 # [Trino] Workers should use latest commit time from table handle
 # Incorrect query results for Merge-On-Read (RT) tables when column stats are 
enabled

 

Upstream starts at commit hash (inclusive): 
ea7f22d0371173a31be0c693a24fa00b7374fe0f

Upstream ends at commit hash (inclusive): 
5e3ebd5b4edd0623e041109cc769f0123bcbd4a7


> Upstream Trino Improvements to Hudi-Trino
> -----------------------------------------
>
>                 Key: HUDI-9767
>                 URL: https://issues.apache.org/jira/browse/HUDI-9767
>             Project: Apache Hudi
>          Issue Type: Task
>            Reporter: Voon Hou
>            Assignee: Voon Hou
>            Priority: Major
>
> Upstreaming a bunch of Trino improvements to Hudi-Trino and they are:
>  
>  # Increase Default MaxOutstandingSplits and SplitLoaderParallelism
>  # HUDI-9525 Extend file system cache support to Hudi connector
>  # Fix split generation parallelism for a non-partitioned table
>  # -[MINOR] Disable PR labeler- (Skipped as this is modifies a file in 
> .github/workflows)
>  # -Add parquet page skipping in Iceberg Connector- (Skipped as this is for 
> testing purposes)
>  # HUDI-9577 Make target result size configurable from endpoint and server
>  # -Create pipeline to build arm image- (Skipped as this modifies github 
> actions)
>  # -[MINOR] Revert changes made to non-hudi modules- (Skipped as this is 
> already upstreamed)
>  # [MINOR] Added optimizations for HudiColumnStatsIndexSupport
>  # [MINOR] Cleanup HudiSplitFactory to extend cache support
>  # Fix flakiness when testing cache correctness
>  # [Trino] Enable Metadata Table by default
>  # [Trino] Fix flaky tests due to table stats computation lagging behind 
> query execution
>  # Implement Metadata table based Partition listing
>  # Fix Case Sensitivity Issues Between Table and Catalog Schemas
>  # [Trino] Workers should use latest commit time from table handle
>  # Incorrect query results for Merge-On-Read (RT) tables when column stats 
> are enabled
>  
> Upstream starts at commit hash (inclusive): 
> ea7f22d0371173a31be0c693a24fa00b7374fe0f
> Upstream ends at commit hash (inclusive): 
> 5e3ebd5b4edd0623e041109cc769f0123bcbd4a7



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (HUDI-9767) Upstream Trino Improvements to Hudi-Trino

Reply via email to