codope commented on code in PR #8758: URL: https://github.com/apache/hudi/pull/8758#discussion_r1232181133
########## hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java: ########## @@ -1620,4 +1620,72 @@ public static String createIndexInitTimestamp(String timestamp, int offset) { public static String createLogCompactionTimestamp(String timestamp) { return timestamp + LOG_COMPACTION_TIMESTAMP_SUFFIX; } + + /** + * Estimates the file group count to use for a MDT partition. + * + * @param partitionType Type of the partition for which the file group count is to be estimated. + * @param recordCount The number of records expected to be written. + * @param averageRecordSize Average size of each record to be writen. + * @param minFileGroupCount Minimum number of file groups to use. + * @param maxFileGroupCount Maximum number of file groups to use. + * @param growthFactor By what factor are the records (recordCount) expected to grow? + * @param maxFileGroupSizeBytes Maximum size of the file group. + * @return The estimated number of file groups. + */ + public static int estimateFileGroupCount(MetadataPartitionType partitionType, long recordCount, int averageRecordSize, int minFileGroupCount, Review Comment: in general, it would be good to have UTs for all public util methods. Especially, this one is a critical method for RI. Let's add a UT. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org