[ https://issues.apache.org/jira/browse/OAK-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Thomas Mueller updated OAK-2466: -------------------------------- Fix Version/s: (was: 1.4) > DataStoreBlobStore: chunk ids should not contain the size > --------------------------------------------------------- > > Key: OAK-2466 > URL: https://issues.apache.org/jira/browse/OAK-2466 > Project: Jackrabbit Oak > Issue Type: Improvement > Components: core > Reporter: Thomas Mueller > Assignee: Thomas Mueller > Labels: datastore, performance > > The blob store garbage collection (data store garbage collection) uses the > chunk ids to identify binaries to be deleted. The blob ids contain the size > now (<contentHash>#<size>), and the blob id is currently equal to the chunk > id. > It would be more efficient to _not_ use the size, and instead just use the > content hash, for the chunk ids. That way, enumerating the entries that are > in the store is potentially faster. Also, it allows us to change the blob id > in the future, for example add more information to it (for example the > creation time, or the first few bytes of the content) if we ever want to. -- This message was sent by Atlassian JIRA (v6.3.4#6332)