Hi Kodiak, I've had the discussion over deduplication before (I can't find it though), and I think the agreement was I would run a test & publish the results at some point.
Basically, Pulp already duplicates completely identical content files such as RPMS & debs. VDO would provide dedup at the (4KB) block layer. However, RPMs, debs, etc contain compression like xz within them. And they do solid compression (like "data.tar.xz" with debs), not a per-file compression. Compression interferes with most of the effects of deduplication: http://thestoragealchemist.com/blog/2010/04/comression-deduplication-oil-water-or-milk-cookies Other content types may be uncompressed. I *think* docker images typically are uncompressed. They would certainly benefit. -Mike On Mon, May 13, 2019 at 2:37 PM Kodiak Firesmith <[email protected]> wrote: > Just curious if anyone is using VDO block dedupe since it went into > production support in RHEL 7.5. Playing with it at Summit got me thinking > about use cases, which of course made me think of Pulp. > Thanks, > - Kodiak > _______________________________________________ > Pulp-list mailing list > [email protected] > https://www.redhat.com/mailman/listinfo/pulp-list
_______________________________________________ Pulp-list mailing list [email protected] https://www.redhat.com/mailman/listinfo/pulp-list
