rdblue commented on PR #14502:
URL: https://github.com/apache/iceberg/pull/14502#issuecomment-3762244527

   I don't know that I think this is a good idea. I think that the primary 
problem is that the snapshot summary may persist partition information that 
could be sensitive. To me, the right solution is to stop embedding partition 
information in the snapshot summary and instead capture that data (if it is 
needed) using the metrics reporting framework and REST endpoint. That solution 
to getting partition metrics keeps partition info out of the snapshot summaries 
and tracks it through a separate path where it can be transient or protected 
differently.
   
   If the primary reason for introducing this is to stop leaking partition 
summary information in snapshots, then I'd recommend solving that problem more 
directly with something like a catalog override that suppresses them. Or just 
drop them at the catalog level when processing `AddSnapshot` changes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to