On Fri, Jan 20, 2012 at 11:45 AM, Jarvis, Matthew <mat...@mckweb.com> wrote:
> I guess I just don't get it.... > > If I was consistent w/ my tags being uppercase, why wouldn't it be the > same compression size as if using all lowercase?? If it's found the > value, it's found the value and doesn't have to have another reference > for it.... Oh, come on, it's pretty obvious... Uppercase letters are BIGGER!!! (kidding) There's a good chance that all the lowercase letters are contained in your body content, kind of like in this sentence: there's a bunch of lowercase letters, but only one uppercase one. Some of the ZIP algorithms work by building up a table of all the characters used and what their compressed representation is. If all 26 lowercase letters are used, but only a few uppercase, the table can be shorter, and, if you luck into the right size, the amount of bits used to express all of the characters used can also be smaller, hence a smaller overall size. > Or was Paul not joking like I thought he was when he mentioned the ASCII > values being higher/lower than the other? That was good leg-pulling, I think. We ought to try recoding the page in EBCDIC and see if the different positions of the symbols make a difference in the resulting compression... -- Ted Roche Ted Roche & Associates, LLC http://www.tedroche.com _______________________________________________ Post Messages to: ProFox@leafe.com Subscription Maintenance: http://leafe.com/mailman/listinfo/profox OT-free version of this list: http://leafe.com/mailman/listinfo/profoxtech Searchable Archive: http://leafe.com/archives/search/profox This message: http://leafe.com/archives/byMID/profox/CACW6n4veHBPY7oErLRE=qTR0KKT=ly+msskzqjkzodppqvm...@mail.gmail.com ** All postings, unless explicitly stated otherwise, are the opinions of the author, and do not constitute legal or medical advice. This statement is added to the messages for those lawyers who are too stupid to see the obvious.