On Thu, Dec 18, 2003 at 12:11:15PM +0330, Arash Bijanzadeh wrote: > I faced a problem with unicode and gotto go inside of this technoligy. I > noticed that for example when I am inserting a Arabic_khah with the unicode > 0f 062E two bytes with values D8 AE would be inserted to the file. Can > anybody tell me what is the relationship between these two value, and how > does it work.
D8 AE is the UTF-8 encoding of the Unicode character U+D8AE ("ARABIC LETTER KHAH"). UTF-8 is the de facto standard encoding of Unicode on Unix systems, partly due to its compatibility with US-ASCII. See RFC 2279 for the definition of UTF-8. Cheers, -- Colin Watson [EMAIL PROTECTED]