[ https://issues.apache.org/jira/browse/HUDI-4613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HUDI-4613: --------------------------------- Labels: pull-request-available (was: ) > Avoid the use of regular expressions when call hoodieFileGroup#addLogFile > function > ---------------------------------------------------------------------------------- > > Key: HUDI-4613 > URL: https://issues.apache.org/jira/browse/HUDI-4613 > Project: Apache Hudi > Issue Type: Improvement > Components: core > Reporter: lei w > Priority: Major > Labels: pull-request-available > > When the number of logFile files exceeds a certain amount of data, the > construction of fsview will become very time-consuming. The reason is that > the LogFileComparator#compare method is frequently called when constructing a > filegroup, and regular expressions are used in this method. > {panel:title=build FileSystemView Log } > INFO view.AbstractTableFileSystemView: #files found in partition (20220810) > =60801, Time taken =28314 > INFO view.AbstractTableFileSystemView: addFilesToView: NumFiles=60801, > NumFileGroups=200, FileGroupsCreationTime=34036, StoreTimeTaken=2 > {panel} -- This message was sent by Atlassian Jira (v8.20.10#820010)