Jim Rising wrote: > back in the days prior to Unicode, these symbol language characters would > take 2 bytes to represent. there even used to be a system called 'DBCS' > (Double Byte Character System) where some letters were stored in 1 byte, > others were stored in 2 bytes.
i know what dbcs means & there's never been a single so-called "Double Byte Character System". pre-unicode was almost *20* years ago. japanese had a butt load of encodings that are single, double & multi-byte ("still has" i guess would be correct). ditto for chinese. "ancient" cyrillic (KOI-8) was a single byte as was pretty much any language that could fit on a windows (or DOS) codepage like thai & arabic. in any case, leaving out codepage nonsense, none of these languages have been anything but multi-byte for at least 10 years. the term dbcs is misleading & we should all stop using it. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| Enterprise web applications, build robust, secure scalable apps today - Try it now ColdFusion Today ColdFusion 8 beta - Build next generation apps Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:291038 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4