On 20/05/2021 13:58, Dave Bond wrote:
As part of a project I am doing I am looking if there are any de
duplication options for GPFS? I see there is no native de dupe for the
filesystem. The scenario would be user A creates a file or folder and
user B takes a copy within the same filesystem, though separate
independent filesets. The intention would be to store 1 copy. So I
was wondering ....
1) Is this is planned to be implemented into GPFS in the future?
2) Is anyone is using any other solutions that have good GPFS integration?
Disk space in 2021 is insanely cheap. With an ESS/DSS you can get many
PB in a single rack. The complexity that dedup introduces is simple not
worth it IMHO.
Or put another way there is better things the developers at IBM can be
working on than dedup code.
Historically if you crunched the numbers the licensing for dedup on
NetApp was similar to just buying more disk unless you where storing
hundreds of copies of the same data. About the only use case scenario
would be storing lots of virtual machines. However I refer you back to
my original point :-)
JAB.
--
Jonathan A. Buzzard Tel: +44141-5483420
HPC System Administrator, ARCHIE-WeSt.
University of Strathclyde, John Anderson Building, Glasgow. G4 0NG
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss