hudi-bot opened a new issue, #15492:
URL: https://github.com/apache/hudi/issues/15492

   For estimated avg record size
   
https://hudi.apache.org/docs/configurations/#hoodiecopyonwriterecordsizeestimate
   
   which is used here
   
https://github.com/apache/hudi/blob/86a1efbff1300603a8180111eae117c7f9dbd8a5/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/UpsertPartitioner.java#L372
   
   Propose to respect user setting by following the precedence as below
   
   1) if user sets a value, then use it as is 
   2) if user not setting it, infer from timeline commit metadata 
   3) if timeline is empty, use a default (current: 1024)
   
   
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-5018
   - Type: Improvement
   
   
   ---
   
   
   ## Comments
   
   07/Nov/22 03:23;xichaomin;Hi [~xushiyan] , I can do this improvement, but I 
have a question, how to tell the avg value is set by user or is default?;;;
   
   ---
   
   15/Nov/22 13:53;xushiyan;[~xichaomin] sure pls go ahead!;;;


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to