On Nov 22, 2007 1:04 PM, Daniel Önnerby <[EMAIL PROTECTED]> wrote: > In the future I am using UTF8 encoded databases since the conversion of > strings is a small thing for the system. The advantages of using UTF8 > are many: > 1. Faster in most cases > 2. Smaller databases (30% smaller in benchmark test database) > 3. Less memory usage OR more information will fit in memory.
Well of course it comes at no surprise that if your database is primarily US-ASCII text, UTF-8 will be better. Smaller sizes mean smaller comparisons and more packed b-trees. UTF-16 is only good if you have a lot of text that would be encoded with >= 2 UTF-8 code units. -- Cory Nelson