[ 
https://issues.apache.org/jira/browse/OAK-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Mueller updated OAK-2466:
--------------------------------
    Fix Version/s:     (was: 1.4)

> DataStoreBlobStore: chunk ids should not contain the size
> ---------------------------------------------------------
>
>                 Key: OAK-2466
>                 URL: https://issues.apache.org/jira/browse/OAK-2466
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: core
>            Reporter: Thomas Mueller
>            Assignee: Thomas Mueller
>              Labels: datastore, performance
>
> The blob store garbage collection (data store garbage collection) uses the 
> chunk ids to identify binaries to be deleted. The blob ids contain the size 
> now (<contentHash>#<size>), and the blob id is currently equal to the chunk 
> id.
> It would be more efficient to _not_ use the size, and instead just use the 
> content hash, for the chunk ids. That way, enumerating the entries that are 
> in the store is potentially faster. Also, it allows us to change the blob id 
> in the future, for example add more information to it (for example the 
> creation time, or the first few bytes of the content) if we ever want to.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to