javrasya commented on issue #9410: URL: https://github.com/apache/iceberg/issues/9410#issuecomment-1888101057
Hi again @pvary. I managed to run it in debug mode and the JSON which is being deserialized is crammed with delete files (`EQUALITY_DELETES` and `POSITION_DELETES`). It is a gigantic json content because of that. We don't do deletes but those must be due to upserting because it is a compacted table based on a key and we keep overriding the rows with the same key. is there any reason for those to be serialized really? Another question; is it possible to skip it with some sort of configuration that is available right now so that it does not keep deleted files without waiting for you guys to fix it in a way? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org