Hi,

Thomas, your patch references 
org.apache.jackrabbit.benchmark.RandomInputStream, wich is not included in the 
patch, and I couldn't find it anywhere.

You are right, I saw that as well. This class is used for testing, you
can ignore it (the test doesn't do anything useful anyway). The next
patch will contain all classes.

Thomas

Regards

> Global data store for binaries
> ------------------------------
>
>                 Key: JCR-926
>                 URL: https://issues.apache.org/jira/browse/JCR-926
>             Project: Jackrabbit
>          Issue Type: New Feature
>          Components: core
>            Reporter: Jukka Zitting
>         Attachments: dataStore.patch, DataStore.patch, DataStore2.patch, 
dataStore3.patch, dataStore4.zip, internalValue.patch, ReadWhileSaveTest.patch
>
>
> There are three main problems with the way Jackrabbit currently handles large 
binary values:
> 1) Persisting a large binary value blocks access to the persistence layer for 
extended amounts of time (see JCR-314)
> 2) At least two copies of binary streams are made when saving them through 
the JCR API: one in the transient space, and one when persisting the value
> 3) Versioining and copy operations on nodes or subtrees that contain large 
binary values can quickly end up consuming excessive amounts of storage space.
> To solve these issues (and to get other nice benefits), I propose that we implement a 
global "data store" concept in the repository. A data store is an append-only set 
of binary values that uses short identifiers to identify and access the stored binary 
values. The data store would trivially fit the requirements of transient space and 
transaction handling due to the append-only nature. An explicit mark-and-sweep garbage 
collection process could be added to avoid concerns about storing garbage values.
> See the recent NGP value record discussion, especially [1], for more 
background on this idea.
> [1] 
http://mail-archives.apache.org/mod_mbox/jackrabbit-dev/200705.mbox/[EMAIL 
PROTECTED]

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Reply via email to