Hello Dan Burkert, Adar Dembo, I'd like you to do a code review. Please visit
http://gerrit.cloudera.org:8080/8001 to review the following change. Change subject: block_id: use a better hash function ...................................................................... block_id: use a better hash function In looking at LBM startup time I noticed that the insertion into the block hashmap was taking substantially longer than advertised in various benchmarks. That led me to think that our block ID patterns were causing high collision rates. Swapping out the hash function for block IDs reduced the startup time substantially. Tested on a host with ~11M blocks across 14 drives using 'fs check'. Before: I0907 13:01:46.997755 21274 fs_manager.cc:335] Time spent opening block manager: real 114.501s user 0.000s sys 0.002s After: I0907 12:58:42.863929 20320 fs_manager.cc:335] Time spent opening block manager: real 69.951s user 0.001s sys 0.001s Change-Id: I30717955f962957d109a6403b55d59ab6c446a87 --- M src/kudu/fs/block_id.h 1 file changed, 7 insertions(+), 1 deletion(-) git pull ssh://gerrit.cloudera.org:29418/kudu refs/changes/01/8001/1 -- To view, visit http://gerrit.cloudera.org:8080/8001 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newchange Gerrit-Change-Id: I30717955f962957d109a6403b55d59ab6c446a87 Gerrit-PatchSet: 1 Gerrit-Project: kudu Gerrit-Branch: master Gerrit-Owner: Todd Lipcon <t...@apache.org> Gerrit-Reviewer: Adar Dembo <a...@cloudera.com> Gerrit-Reviewer: Dan Burkert <danburk...@apache.org>