codope commented on pull request #3830:
URL: https://github.com/apache/hudi/pull/3830#issuecomment-951772398


   > trying to understand why do we need this change? metadata table is 
something internally managed. I prefer to enable schema validation always.
   
   I would prefer to validate the schema as well. This change is just to avoid 
the flakiness in 
[TestHoodieDeltaStreamer.testAsyncClusteringServiceWithCompaction](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_apis/build/builds/2725/logs/61).
 By default the schema validation is turned off. It was hardcoded to true for 
metadata table.
   
   However, on a high level, my thinking is even though the metadata table is 
internally managed shouldn't we have the data table configs govern the metadata 
table configs as well instead of hardcoding values? This may not be true for 
all configs but for some we can inherit the configs directly from the data 
table. For instance, let's take the scehma validation config. For whatever 
reason (efficiency, or avoid validation failure), user wants it to be turned 
off doesn't it make sense to turn it off for the metadata table as well for 
those same reasons? 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to