[ https://issues.apache.org/jira/browse/IMPALA-10254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tamas Mate closed IMPALA-10254. ------------------------------- Resolution: Fixed > Load data files via Iceberg for Iceberg Tables > ---------------------------------------------- > > Key: IMPALA-10254 > URL: https://issues.apache.org/jira/browse/IMPALA-10254 > Project: IMPALA > Issue Type: Bug > Components: Catalog > Reporter: Zoltán Borók-Nagy > Assignee: Tamas Mate > Priority: Major > Labels: impala-iceberg > > Currently we still load the file descriptors of an Iceberg table via > recursive file listing. > This lists too many files, e.g. metadata files, files that are being written > (can later throw checksum errors), files from aborted INSERTs, removed files, > etc. > We should use the Iceberg API to load the file descriptors corresponding to > the table snapshot. Iceberg DataFiles might also already contain the split > offsets. -- This message was sent by Atlassian Jira (v8.20.10#820010)