Tom,

Currently, the AIP Packager is (wrongly & unfortunately) hardcoded to 
use Zip.

I do think this is a great feature request.  Personally, I've not yet 
tried to make this change so that you could either either TAR or ZIP.

But, off the top of my head, here's how you'd do it:

(1) Change the 'writeZipPackage()' method of 
'org.dspace.content.packager.AbstractMETSDisseminator' so that it is 
configurable to use either:
      - java.util.zip.ZipOutputStream (and associated classes)
      - OR, 
org.apache.commons.compress.archivers.tar.TarArchiveOutputStream (and 
associated classes)

(2) Change the 'parsePackage()' method of 
'org.dspace.content.packager.AbstractMETSIngester' so that it is also 
configurable to use either:
       - java.util.zip.ZipInputStream (and associated classes)
       - OR, 
org.apache.commons.compress.archivers.tar.TarArchiveInputStream (and 
associated classes)

Again, unfortunately, I've not yet tried this. But, I have added this 
enhancement request to our JIRA. So, once someone has time to look at 
this, we can get it added to the AIP packager.

https://jira.duraspace.org/browse/DS-1137

--
One sidenote here: in Java 7, Zip64 is now suppported (allows for files 
up to 2^64 bytes), which removes this limitation from the 
java.util.zip.* classes.

https://blogs.oracle.com/xuemingshen/entry/zip64_support_for_4g_zipfile
http://en.wikipedia.org/wiki/Zip_%28file_format%29#ZIP64

- Tim

On 2/29/2012 11:35 AM, Tom Autry wrote:
> With the current size limitations in Java ZIP at 4G and 65K files, is
> there an easy way to change the underlying output to a different format
> such as TAR instead of ZIP to get around this limit as we have files
> much larger that this that are being ingested? Has anyone else run into
> this limitation or problems with very large files? Or possibly export to
> a directory structure instead of compiling into files and then we can
> use a different tool to create a single package?
>
> Thanks.
>
> Tom Autry
>
> Coffing Corporation
>
> 3136 Presidential Drive
>
> Fairborn, Ohio 45324
>
> Office: 937-458-6100
>
> Cell: 937-361-4680
>
> Email: [email protected]
>
>
> ------------------------------------------------------------------------
> This e-mail message and any attachments may contain legally privileged,
> confidential or proprietary information. If you are not the intended
> recipient(s),or the employee or agent responsible for delivery of this
> message to the intended recipient(s), you are hereby notified that any
> dissemination, distribution or copying of this e-mail message is
> strictly prohibited. If you have received this message in error, please
> immediately notify the sender and delete this e-mail message from your
> computer. Any views expressed in this message are those of the
> individual sender and may not necessarily reflect the views of the company.
>
>
> ------------------------------------------------------------------------------
> Virtualization&  Cloud Management Using Capacity Planning
> Cloud computing makes use of virtualization - but cloud computing
> also focuses on allowing computing to be delivered as a service.
> http://www.accelacomm.com/jaw/sfnl/114/51521223/
>
>
>
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech

------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to