hudi-bot opened a new issue, #16440: URL: https://github.com/apache/hudi/issues/16440
The table schema resolver needs to read schema from the data files (base or log files) to see whether _hoodie_operation field is present for Flink CDC use cases. This can cause overhead of reading data file footers multiple times. We should see if we can store a table config to indicate if _hoodie_operation field is present in the table, or simplify the Flink CDC format in Hudi 1.0 (thus no need of _hoodie_operation field and schema resolver). ## JIRA info - Link: https://issues.apache.org/jira/browse/HUDI-7585 - Type: Improvement - Epic: https://issues.apache.org/jira/browse/HUDI-6242 - Fix version(s): - 1.1.0 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
