On Fri, 8 Nov 2024 at 08:48, Jim DeLaHunt via Unicode <[email protected]> wrote: > If you want to understand how Linux behaves, ask > Linux and the Linux source code.
Indeed. I've been a long-term user of Unicode with Linux (mostly Debian). I do quite a lot of conversion between legacy codes, such as JIS, and Unicode (usually in UTF-8 format). Virtually all my code conversion is done using "iconv" which is available as a library routine and a command-line utility. The source code is available, see: https://www.gnu.org/software/libiconv/ On rare occasions, I need to dig into UTF-8 at the bit level. I have a note pinned near my desk as an aide memoire. It has 3 lines: UTF-8 zzzzyyyyyxxxxx 1110zzzz 10yyyyyy 10xxxxxx Cheers Jim -- Jim Breen Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University http://www.jimbreen.org/ http://nihongo.monash.edu/
