If you don't care about compression (reducing storage size of the
Zip/Tar/etc. container) and just want to bundle them together, than you can
use a shell command and execute a tar against the directory where the PDFs
are stored.  Tar doesn't compress by default; don't pass it any extra
compression options.

If you care about compression and want speed while trading off a bit of
compression, then use a shell command and execute using an lzop compressor
(but there are many to choose from like lz4 and zstd, but ensure whichever
native executable you choose supports multi-core cpu processing in order to
be fast, if you have this as a repeatable workflow)

The reason for using a Shell command to process this outside of Hop is so
that you can take advantage of native hardware CPU acceleration (SIMD,
cache, etc.) since the current internal Java implemenations are not highly
optimized to take advantage of native hardware CPU acceleration (but that
could change and be improved with the Java Vector API forthcoming changes
and changes in Hop).

Thad
https://www.linkedin.com/in/thadguidry/
https://calendly.com/thadguidry/


On Fri, Jul 12, 2024 at 6:06 AM Rob Burgess <[email protected]> wrote:

> Hi
>
> I would like to zip approximately 30,000 PDF files based on the date
> they were  modified.
>
> What is the quickest way to do this in Hop?
>
> Thanks
>
> Rob
>

Reply via email to