Bryan Beaudreault created HBASE-28538:
-----------------------------------------
Summary: BackupHFileCleaner.loadHFileRefs is very expensive
Key: HBASE-28538
URL: https://issues.apache.org/jira/browse/HBASE-28538
Project: HBase
Issue Type: Bug
Components: backup&restore
Reporter: Bryan Beaudreault
I noticed some odd CPU spikes on the hmasters of one of our clusters. Turns out
it had been getting lots of bulkoads (30k) and processing them was expensive.
The method scans hbase and then parses the paths. Surprisingly the parsing is
more expensive than the reading hbase, with the vast majority of time spent in
org/apache/hadoop/fs/Path.<init>.
We should see if this is possible to be optimized. Attaching profile.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)