On 25.07.08, Steve Litt wrote:
...

> I know nothing about unicode, so I'd need to be brought up to speed on that 
> before writing the program. 

References:

    * The Absolute Minimum Every Software Developer Absolutely,
      Positively Must Know About Unicode and Character Sets (No Excuses!)
      http://www.joelonsoftware.com/articles/Unicode.html

    * On the Goodness of Unicode
      http://www.tbray.org/ongoing/When/200x/2003/04/06/Unicode
    
    * Wikipedia article on Unicode
      http://en.wikipedia.org/wiki/Unicode

> I assume unicode is a 16 bit representation of characters.

Actually, unicode is a character <--> number mapping without an upper
limit to the numbers. 

Once upon a time, 16 bit where enough to represent all defined unicode
characters, but even then several different encodings into a computer
readable format existed. Programs that relied on unicode == 16 bit
(including LaTeX) have problems now with higher unicode numbers.

Günter

Reply via email to