On Thu, Jan 21, 2010 at 2:51 PM, Andrey Kuzmin
<andrey.v.kuz...@gmail.com> wrote:
> Looking at dedupe code, I noticed that on-disk DDT entries are
> compressed less efficiently than possible: key is not compressed at
> all (I'd expect roughly 2:1 compression ration with sha256 data),

A cryptographic hash such as sha256 should not be compressible.  A
trivial example shows this to be the case:

for i in {1..10000} ; do
    echo $i | openssl dgst -sha256 -binary
done > /tmp/sha256

$ gzip -c <sha256 >sha256.gz
$ compress -c <sha256 >sha256.Z
$ bzip2 -c <sha256 >sha256.bz2

$ ls -go sha256*
-rw-r--r--   1  320000 Jan 22 04:13 sha256
-rw-r--r--   1  428411 Jan 22 04:14 sha256.Z
-rw-r--r--   1  321846 Jan 22 04:14 sha256.bz2
-rw-r--r--   1  320068 Jan 22 04:14 sha256.gz

-- 
Mike Gerdts
http://mgerdts.blogspot.com/
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to