On Fri, Jan 20, 2012 at 11:45 AM, Jarvis, Matthew <mat...@mckweb.com> wrote:

> I guess I just don't get it....
>
> If I was consistent w/ my tags being uppercase, why wouldn't it be the
> same compression size as if using all lowercase?? If it's found the
> value, it's found the value and doesn't have to have another reference
> for it....

Oh, come on, it's pretty obvious... Uppercase letters are BIGGER!!!

(kidding)

There's a good chance that all the lowercase letters are contained in
your body content, kind of like in this sentence: there's a bunch of
lowercase letters, but only one uppercase one.

Some of the ZIP algorithms work by building up a table of all the
characters used and what their compressed representation is. If all 26
lowercase letters are used, but only a few uppercase, the table can be
shorter, and, if you luck into the right size, the amount of bits used
to express all of the characters used can also be smaller, hence a
smaller overall size.

> Or was Paul not joking like I thought he was when he mentioned the ASCII
> values being higher/lower than the other?

That was good leg-pulling, I think. We ought to try recoding the page
in EBCDIC and see if the different positions of the symbols make a
difference in the resulting compression...

-- 
Ted Roche
Ted Roche & Associates, LLC
http://www.tedroche.com

_______________________________________________
Post Messages to: ProFox@leafe.com
Subscription Maintenance: http://leafe.com/mailman/listinfo/profox
OT-free version of this list: http://leafe.com/mailman/listinfo/profoxtech
Searchable Archive: http://leafe.com/archives/search/profox
This message: 
http://leafe.com/archives/byMID/profox/CACW6n4veHBPY7oErLRE=qTR0KKT=ly+msskzqjkzodppqvm...@mail.gmail.com
** All postings, unless explicitly stated otherwise, are the opinions of the 
author, and do not constitute legal or medical advice. This statement is added 
to the messages for those lawyers who are too stupid to see the obvious.

Reply via email to