[ 
https://issues.apache.org/jira/browse/COMPRESS-540?focusedWorklogId=528079&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-528079
 ]

ASF GitHub Bot logged work on COMPRESS-540:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 24/Dec/20 11:15
            Start Date: 24/Dec/20 11:15
    Worklog Time Spent: 10m 
      Work Description: theobisproject commented on pull request #113:
URL: https://github.com/apache/commons-compress/pull/113#issuecomment-750851579


   Hey @PeterAlfredLee I can invest some time starting from tomorrow. It would 
be good if you are able to go through the changes once again and comment on the 
TODOs I left in there.
   Regarding the removal of the code duplication in my opinion this would 
require some major changes to the existing `TarArchiveInputStream`. So we 
should decide what we want to do about it in this PR.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 528079)
    Time Spent: 5h 20m  (was: 5h 10m)

> Random access on Tar archive
> ----------------------------
>
>                 Key: COMPRESS-540
>                 URL: https://issues.apache.org/jira/browse/COMPRESS-540
>             Project: Commons Compress
>          Issue Type: Improvement
>            Reporter: Robin Schimpf
>            Priority: Major
>          Time Spent: 5h 20m
>  Remaining Estimate: 0h
>
> The TarArchiveInputStream only provides sequential access. If only a small 
> amount of files from the archive is needed large amount of data in the input 
> stream needs to be skipped.
> Therefore I was working on a implementation to provide random access to 
> TarFiles equal to the ZipFile api. The basic idea behind the implementation 
> is the following
>  * Random access is backed by a SeekableByteChannel
>  * Read all headers of the tar file and save the place to the data of every 
> header
>  * User can request an input stream for any entry in the archive multiple 
> times



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to