hudi-bot opened a new issue, #15492: URL: https://github.com/apache/hudi/issues/15492
For estimated avg record size https://hudi.apache.org/docs/configurations/#hoodiecopyonwriterecordsizeestimate which is used here https://github.com/apache/hudi/blob/86a1efbff1300603a8180111eae117c7f9dbd8a5/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/UpsertPartitioner.java#L372 Propose to respect user setting by following the precedence as below 1) if user sets a value, then use it as is 2) if user not setting it, infer from timeline commit metadata 3) if timeline is empty, use a default (current: 1024) ## JIRA info - Link: https://issues.apache.org/jira/browse/HUDI-5018 - Type: Improvement --- ## Comments 07/Nov/22 03:23;xichaomin;Hi [~xushiyan] , I can do this improvement, but I have a question, how to tell the avg value is set by user or is default?;;; --- 15/Nov/22 13:53;xushiyan;[~xichaomin] sure pls go ahead!;;; -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
