Bryan Beaudreault created HBASE-28538:
-----------------------------------------

             Summary: BackupHFileCleaner.loadHFileRefs is very expensive
                 Key: HBASE-28538
                 URL: https://issues.apache.org/jira/browse/HBASE-28538
             Project: HBase
          Issue Type: Bug
          Components: backup&restore
            Reporter: Bryan Beaudreault


I noticed some odd CPU spikes on the hmasters of one of our clusters. Turns out 
it had been getting lots of bulkoads (30k) and processing them was expensive. 
The method scans hbase and then parses the paths. Surprisingly the parsing is 
more expensive than the reading hbase, with the vast majority of time spent in 
org/apache/hadoop/fs/Path.<init>.

We should see if this is possible to be optimized. Attaching profile.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to