szehon-ho opened a new issue #2481: URL: https://github.com/apache/iceberg/issues/2481
Say we have a use case to experiment on the side with a certain table snapshot (do some modifications, do some checks, etc), but you don't want to alter the table's history. I think 'snapshot' command will be very useful here. We can generate quickly a separate table metadata pointing to the snapshot, instead of copying all the data into a side table for the experiments. It can in the beginning make a soft copy of the current Iceberg table's snapshot, maybe it can take a snapshotid as an argument. Was chatting with @RussellSpitzer and the only negative we think is that if you expire original table's snapshot and remove orphan files than this table will not be able to be read. But it is the same problem as snapshotting a Hive table (dropping some files on original table will corrupt the new table). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
