I like to know how are SMS messages encoded. What encoded is used ? UCS-2 ? UTF-8 ? Possible scenarios: (1) simple ASCII text; (2) characters from ASCII combined with some from ISO-8859-x; (3) all Asian text; (...)
The three encodings that I know have been used for SMS text, at least with Palm OS, are:
a. Raw text - 8 bit undefined, which usually gets treated as ISO 8859-1, but it completely up to the recipient. Or at least that was my understanding.
b. GSM - a 7-bit encoding with two unusual features: it uses 0 (NULL) as a valid code point, and an escape as a way of accessing a secondary table of code points.
c. UCS-2.
Depending on your locale, different approaches are taken to encoding messages. In China I think all messages are sent as UCS-2. In Hong Kong, first GSM is tried, but if that can't represent all of the characters then UCS-2 is the fall-back.
-- Ken -- Ken Krugler TransPac Software, Inc. <http://www.transpac.com> +1 530-470-9200

