Thomas Mueller created OAK-11781:
------------------------------------

             Summary: Binary reference statistics are inaccurate for very large 
repositories
                 Key: OAK-11781
                 URL: https://issues.apache.org/jira/browse/OAK-11781
             Project: Jackrabbit Oak
          Issue Type: Improvement
            Reporter: Thomas Mueller


The DistinctBinarySize report is inaccurate if there are more than around 16 
million binary references: right now the Bloom filter size is set to 16 MB, but 
this is not enough for some repositories and leads to a very high 
false-positive rate of around 95% (normal is 1%).

It is quite easy to increase the memory size for the Bloom filter.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to