Re: compressing short strings?

2008-05-21 Thread Paul Rubin
"inhahe" <[EMAIL PROTECTED]> writes: > i don't see anybody mentioning huffman encoding. i think it just works per > byte, so it's not as tight as gzip or whatever. but it sounds like it would > be easy to implement and wouldn't require any corpus-wide compression > information. except a charac

Re: compressing short strings?

2008-05-21 Thread inhahe
i don't see anybody mentioning huffman encoding. i think it just works per byte, so it's not as tight as gzip or whatever. but it sounds like it would be easy to implement and wouldn't require any corpus-wide compression information. except a character frequency count if you wanted to be optim

Re: compressing short strings?

2008-05-20 Thread castironpi
On May 20, 8:24 am, [EMAIL PROTECTED] wrote: > bearophile: > > > So you need to store only this 11 byte long string to be able to > > decompress it. > > Note that maybe there is a header, that may contain changing things, > like the length of the compressed text, etc. > > Bye, > bearophile I've re

Re: compressing short strings?

2008-05-20 Thread bearophileHUGS
bearophile: > So you need to store only this 11 byte long string to be able to > decompress it. Note that maybe there is a header, that may contain changing things, like the length of the compressed text, etc. Bye, bearophile -- http://mail.python.org/mailman/listinfo/python-list

Re: compressing short strings?

2008-05-20 Thread bearophileHUGS
Helmut Jarausch: > I'd ask in comp.compression where the specialists are listening and who are > very helpful. Asking in comp.compression is a good starting point. My suggestions (sorry if they look a bit unsorted): it depends on what language you want to use, how much you want to compress the st

Re: compressing short strings?

2008-05-20 Thread Helmut Jarausch
Paul Rubin wrote: I have a lot of short English strings I'd like to compress in order to reduce the size of a database. That is, I'd like a compression function that takes a string like (for example) "George Washington" and returns a shorter string, with luck maybe 6 bytes or so. One obvious id

Re: compressing short strings?

2008-05-20 Thread Thomas Troeger
Paul Rubin wrote: I have a lot of short English strings I'd like to compress in order to reduce the size of a database. That is, I'd like a compression function that takes a string like (for example) "George Washington" [...] Thanks. I think your idea is good, maybe you'd want to build an

Re: compressing short strings?

2008-05-20 Thread Arnaud Delobelle
Paul Rubin writes: > I have a lot of short English strings I'd like to compress in order to > reduce the size of a database. That is, I'd like a compression > function that takes a string like (for example) "George Washington" > and returns a shorter string, with luck m

compressing short strings?

2008-05-20 Thread Paul Rubin
I have a lot of short English strings I'd like to compress in order to reduce the size of a database. That is, I'd like a compression function that takes a string like (for example) "George Washington" and returns a shorter string, with luck maybe 6 bytes or so. One obvious idea is take the gzip