Re: [PR] Spark: Rewrite V2 deletes to V3 DVs [iceberg]

via GitHub Thu, 27 Feb 2025 00:00:31 -0800


nastra commented on code in PR #12250:
URL: https://github.com/apache/iceberg/pull/12250#discussion_r1973052621



##########
spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/SparkPositionDeletesRewrite.java:
##########
@@ -200,7 +207,8 @@ static class PositionDeletesWriterFactory implements 
DataWriterFactory {
         StructType dsSchema,
         int specId,
         StructLike partition,
-        Map<String, String> writeProperties) {
+        Map<String, String> writeProperties,
+        int formatVersion) {

Review Comment:
   we already have the format version in `SerializableTable` but in 
https://github.com/apache/iceberg/pull/11620 we concluded that we don't want to 
support fetching a format version from a `SerializableMetadataTable`. 
   
   There was also some discussion in 
https://github.com/apache/iceberg/pull/11620#discussion_r1852542188 where we 
basically concluded that it's probably better to let the calling site fetch the 
underlying table from `PositionDeletesTable` instead of only supporting the 
retrieval of a format version from the `PositionDeletesTable` and not all the 
other metadata tables in `TableUtil`.
   
   Also the reason why we're passing the `formatVersion` here is so that we 
avoid doing a distributed table metadata load on worker nodes.
   
   Does that make sense or do you think we should still make some changes to 
properly support fetching the format version from the `PositionDeletesTable` in 
`TableUtil`/`SerializableMetadataTable`?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] Spark: Rewrite V2 deletes to V3 DVs [iceberg]

Reply via email to