danielcweeks opened a new pull request, #14979:
URL: https://github.com/apache/iceberg/pull/14979

   Kafka connect events for data written currently include the name of the 
target table but don't capture the UUID.  Since the coordinator loads the table 
at time of commit and then processes the events, this can result in a number of 
issues if the table was dropped/moved and recreated including: 
   
   - mismatched data file schema (field ids/types don't match the new table)
   - path mismatches where referenced data files are outside of the new table 
location
   - other potential metadata mismatches like referenced data files that no 
longer exist or invalid row ids
   
   This PR validates that the UUID of the table used when constructing the 
writer is consistent with the target table UUID at commit time.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to