[ 
https://issues.apache.org/jira/browse/OAK-4201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370894#comment-15370894
 ] 

Alex Parvulescu commented on OAK-4201:
--------------------------------------

Looks good to me +1

> Add an index of binary references in a tar file
> -----------------------------------------------
>
>                 Key: OAK-4201
>                 URL: https://issues.apache.org/jira/browse/OAK-4201
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: segment-tar
>            Reporter: Chetan Mehrotra
>            Assignee: Francesco Mari
>             Fix For: Segment Tar 0.0.4
>
>         Attachments: OAK-4201-01.patch
>
>
> Currently for  Blob GC in case of segment {{SegmentBlobReferenceRetriever}} 
> goes through all tar files and extracts the binary references. This has 2 
> issues
> # Logic has go through all the segments in all tar files
> # All segments get loaded in memory once which would affect normal system 
> performance
> This process can be optimized if we also write a file entry in tar (similar 
> to gph i.e. graph and idx i.e. index files) which has entries of all binary 
> references referred to in any segment present in that tar file. Then GC logic 
> would just have read this file and avoid scanning all the segments



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to