dramaticlly opened a new issue, #7189:
URL: https://github.com/apache/iceberg/issues/7189

   ### Feature Request / Improvement
   
   Based on @szehon-ho comment in #6661 when we trying to add delele file stats 
in partition table. 
   
   https://github.com/apache/iceberg/pull/6661#discussion_r1132981630
   
   Today, the partition table are using one of its kind 
`ManifestGroup.planFiles() / FileScanTask` to read list of data files and 
aggregagte the partition level stats such as record count and files count. 
   
   Szehon proposed to refactor this into ManifestReader to plan for manifests 
read and subsequently data files read, this enable us to read both data 
manifests and delete manifests into a coherent way and avoid keeping large 
hashset 
   - For Data files: we can use `ManifestReader.read()`
   - For delete files: we can use `ManifestReader.readDeleteManifest()` 
   
   
   
   ### Query engine
   
   None


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to