xicm commented on PR #7226: URL: https://github.com/apache/hudi/pull/7226#issuecomment-1321470483
> Considering this scenario, I will set `COPY_ON_WRITE_RECORD_SIZE_ESTIMATE` is more smaller than the original to prevent generate a large number of small files when I first load data. But I can't accurately estimate the size of each record after load in hudi, only use the original data meta. And next time, if I hava history data in hudi table, I tend to use the existing data to calculate the record size. And if the schema is evolution, the user provide may be will inaccurate, they need to reset the conf, right? @KnightChess Yes, you are right. The original logic has a problem that user provided avg value does not take effect. @xushiyan could you explain more about the original purpose of this issue? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org