lei w created HUDI-4613:
---------------------------

             Summary: Avoid the use of regular expressions when call 
hoodieFileGroup#addLogFile function
                 Key: HUDI-4613
                 URL: https://issues.apache.org/jira/browse/HUDI-4613
             Project: Apache Hudi
          Issue Type: Improvement
          Components: compaction
            Reporter: lei w


When the number of logFile files exceeds a certain amount of data, the 
construction of fsview will become very time-consuming. The reason is that the 
LogFileComparator#compare method is frequently called when constructing a 
filegroup, and regular expressions are used in this method.

{panel:title=build FileSystemView Log }
 INFO view.AbstractTableFileSystemView: #files found in partition (20220810) 
=60801, Time taken =28314
 INFO view.AbstractTableFileSystemView: addFilesToView: NumFiles=60801, 
NumFileGroups=200, FileGroupsCreationTime=34036, StoreTimeTaken=2
{panel}




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to