szehon-ho opened a new issue #2481:
URL: https://github.com/apache/iceberg/issues/2481


   Say we have a use case to experiment on the side with a certain table 
snapshot (do some modifications, do some checks, etc), but you don't want to 
alter the table's history.
   
   I think 'snapshot' command will be very useful here.  We can generate 
quickly a separate table metadata pointing to the snapshot, instead of copying 
all the data into a side table for the experiments.
   
   It can in the beginning make a soft copy of the current Iceberg table's 
snapshot, maybe it can take a snapshotid as an argument.
   
   Was chatting with @RussellSpitzer  and the only negative we think is that if 
you expire original table's snapshot and remove orphan files than this table 
will not be able to be read.  But it is the same problem as snapshotting a Hive 
table (dropping some files on original table will corrupt the new table).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to