Hi,

Out of this thread Bill Freeman and I got onto a side discussion of the
compression efficiency of gzip(1) vs blocksize.  This led to an
experiment to take the first 2MB of /usr/doc/HOWTO/* and /bin/* and see
how well that blob compresses when broken up into blocks where each block
is gzip compressed separately. (for blocksizes 128, 256, 512, ... to 2MB.)

The results are below, in the form:

  blocksize, compression-ratio, compressed-size/uncompressed-size

This could be useful background info to keep in the back of one's mind...
Even small blocks of 1-2K get a reasonable fraction of the full compression.

/usr/doc/HOWTO/*

total size: 2000000

     128  0.9773   1954520/2000000
     256  0.7569   1513703/2000000
     512  0.6234   1246810/2000000
    1024  0.5348   1069536/2000000
    2048  0.4703    940685/2000000
    4096  0.4240    847974/2000000
    8192  0.3896    779129/2000000
   16384  0.3637    727375/2000000
   32768  0.3427    685395/2000000
   65536  0.3283    656661/2000000
  131072  0.3206    641290/2000000
  262144  0.3166    633217/2000000
  524288  0.3146    629162/2000000
 1048576  0.3136    627276/2000000
 2000000  0.3134    626753/2000000

/bin/*

total size: 2000000

     128  1.0111   2022191/2000000
     256  0.8391   1678176/2000000
     512  0.7242   1448394/2000000
    1024  0.6346   1269181/2000000
    2048  0.5748   1149687/2000000
    4096  0.5370   1073970/2000000
    8192  0.5130   1026006/2000000
   16384  0.4961    992103/2000000
   32768  0.4837    967453/2000000
   65536  0.4758    951601/2000000
  131072  0.4730    946040/2000000
  262144  0.4710    942082/2000000
  524288  0.4704    940761/2000000
 1048576  0.4692    938445/2000000
 2000000  0.4693    938545/2000000


*****************************************************************
To unsubscribe from this list, send mail to [EMAIL PROTECTED]
with the text 'unsubscribe gnhlug' in the message body.
*****************************************************************

Reply via email to