xicm commented on PR #7226:
URL: https://github.com/apache/hudi/pull/7226#issuecomment-1321470483

   > Considering this scenario, I will set `COPY_ON_WRITE_RECORD_SIZE_ESTIMATE` 
is more smaller than the original to prevent generate a large number of small 
files when I first load data. But I can't accurately estimate the size of each 
record after load in hudi, only use the original data meta. And next time, if I 
hava history data in hudi table, I tend to use the existing data to calculate 
the record size. And if the schema is evolution, the user provide may be will 
inaccurate, they need to reset the conf, right?
   
   @KnightChess Yes, you are right. 
   The original logic has a problem that user provided avg value does not take 
effect.
   @xushiyan could you explain more about the original purpose of this issue? 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to